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Soluble Fragments of the SARS-CoV Spike Glycoprotein 

This application claims priority from U.S. Application Ser. No. 
5 60/489466 filed July 21, 2003 and from U.S. Application Ser. No. 60/524,642 
filed November 25, 2003, which are hereby incorporated by reference in their 
entireties. 

Government Funding 
The invention described herein was developed with the support of the ' 
10 Department of Health and Human Services. The United States Government has 
certain rights in the invention. 

Field of the Invention 
The invention relates generally to a spike polj^eptide that is encoded by 

15 a coronavirus (herein SARS-CoV), which is etiologically linked to Severe Acute 
Respiratory Syndrome (SARS), The invention ftirther relates to nucleic acids 
and polypeptides having amino acid sequences that correspond to fragments of 
spike protein of SARS-CoV, and conservative variants thereof The invention 
also relates to use of these nucleic acids, polypeptides, variants, and fragments to 

20 produce antibodies that recognize the spike protein of SARS-CoV, and for the 
production of vaccines against SARS. Another aspect of the invention relates to 
spike protein fragments for inhibiting ftision of the SARS-CoV with animal 
cells. 

Background of the Invention 
25 Severe acute respiratory syndrome (SARS) is an infectious atypical 

pneumonia that has recently been recognized in patients in 32 countries and 
regions. The atypical pneumonia with unknown etiology was initially observed 
in Guangdong Province, China. This observation was followed by reports from 
Hong Kong, Vietnam, Singapore, Canada and Beijing of severe febrile . 
30 respiratory illness that spread to household members and health care workers. 
This disease was later designated "severe acute respiratory syndrome (SARS)" 
by the World Health Organization (WHO). Until May 19, 2003, a cumulative 
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total of 7,864 SARS cases were reported to WHO from 29 countries. A total of 
643 deaths (case-fatality proportion: 8.2 %) were reported. 

Researchers around the world have sequenced the genome of SARS 
causing viruses from different regions of the globe. The viruses have been 
5 classified as coronaviruses. Coronaviruses have been grouped into three 
categories based on cross-reactivity of antibodies backed up by genetic data. 
Two previously known human viruses fell into different groups than SARS- 
CoV. The coronavirus that causes SARS does not fit into any of the previously 
known clusters. Rather, it forms a new group by itself. Phylogenetic analysis of 
1 0 the predicted viral proteins indicates that the virus does not closely resemble any 
of the three previously known groups of coronaviruses. Most coronaviruses 
cause either a respiratory or an enteric disease, which is also transmitted by the 
faecal-oral route. 

The incubation period for SARS is usually 2 to 7 days. Infection is 
1 5 characterized by fever, non-productive cough, shortness of breath, and the 
presence of minimal auscultatory findings with consolidation on chest 
radiographs. Lymphopenia, leucopcma, thrombocytopenia, and elevated liver 
enzymes and creatinine kinase may also be present in most cases. Symptoms 
relating to the gastrointestinal tract were also noticed in SARS patients. 
20 Pathological studies of patients who died of SARS from Guangdong, 

Hongkong, Beijing and Singapore showed dififiise alveolar damage (DAD) in the 
limg as the most notable feature, hi those individuals with severe disease 
resulting in death, scattered type 11 pneumocytes showed marked cytologic 
changes that include multinucleation, cytomegaly, nucleomegaly, clearing of 
25 nuclear chromatin, and prominent nucleoli. Although these changes were severe, 
they were within the spectrum of epithelial changes seen in other cases of diffiise 
alveolar damage. Morphologic changes that were identified included bronchial 
epithelial denudation, loss of cilia, and squamous metaplasia. Other findings 
included focal intraalveolar hemorrhage, hemophagocytosis, necrotic 
30 inflammatory debris in small airways, organizing pneumonia or secondary 
bacterial pneumonia. 

The pathogenesis of this disorder remains to be determined. However, , 
the mechanism of acute lung injury could involve direct damage by the virus to 
the alveolar wall by targeting either endothelial cells or epithelial cells. 
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Alternatively, the virus could iiifect inflammatory cells with the injury mediated 
through cytokines, interleukins, or tumor necrosis factor-alpha. It is also 
possible that the tissue damage in SARS is not directly related to viral infection 
in tissues but is a secondary effect of cytokines or other factors induced by viral 
5 infection proximal to but not within the lung tissue. 

Pathologic evaluation of the fatal cases showed that hepatocytes 
underwent fatty degeneration, cloudy swelling, apoptosis and dot necrosis, with 
Kupffer cell proliferation and portal infiltrates of lymphocytes. There were 
regional hemorrhages, vascular congestion and lymphocytic infiltration in 

1 0 gastrointestinal walls of the patient. 

Due to the abihty of SARS-CoV to be spread through an airborne route, 
SARS-CoV presents a particular threat to the health of large populations of 
people throughout the world. Accordingly, methods to immunize people before 
infection, diagnose infection, immunize people during infection, and treat 

1 5 infected persons infected with SARS-CoV are greatly needed. 

Summarv of the hivention 
These and other needs are met by the invention described herein. The 
invention provides polypeptides; peptide fragments; viral fusion inhibitors; 

20 coupled proteins; immimopeptides; immune compositions; peptidomimetics; 

nucleic acid segments; expression cassettes; nucleic acid constructs; recombinant 
viruses; viral vaccines; peptide vaccines; microorganism vaccines; DNA 
vaccines; antibodies; aptamers; pharmaceutical compositions; methods to 
immunize an animal; a method to treat severe acute respiratory syndrome 

25 (SARS); methods to diagnose SARS; and kits. 

The invention provides polypeptides having an amino acid sequence 
corresponding to that of a polypeptide that is etiologically linked to SARS. 
Preferably the polypeptide is the spike protein from SARS-CoV that can inhibit 
SARS fusion with animal cells and/or raise immune response in an animal. Li 

30 some embodiments, the polypeptide is a soluble form of the spike protein from 
SARS-CoV. hi other embodiments, the polypeptide includes amino acids 17- 
757 of the spike protein from SARS-CoV. In some embodiments, the 
polypeptide includes amino acids 762-1189 of the spike protein from SARS- 
CoV. In other embodiments, the polypqjtide includes amino acids 17-757 of the 
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spike protein from SARS-CoV. In some embodiments, the polypeptide includes 
amino acids 17-276 of the spike protein from SARS-CoV. In other 
embodiments, the polypeptide includes amino acids 303-537 of the spike protein 
fromSARS-CoV. In some embodiments, the polypeptide includes amino acids 
5 317-517 of the spike protein from SARS-CoV. In other embodiments, the 

polypeptide mcludes amino acids 272-537 of the spike protein from SARS-CoV. 
In some embodiments, the polypeptide includes amino acids 17-537 of the spike 
protein from SARS-CoV. In other embodiments, the polypeptide includes 
amino acids 17-1 189 (relative to SEQ ID NO: 1) of the spike protein from 
1 0 SARS-CoV. The polypeptides of the invention can inhibit SARS-CoV fusion 
with animal cells. The nucleic acids and polypeptides of the invention can elicit 
an immune response when used to inoculate an animal. In some embodiments, 
the nucleic acids and polypeptides of the invention elicit a cellular immune 
response when used to inoculate an animal. In other embodiments, the nucleic 
1 5 acids and polypeptides of the invention elicit a humoral immune response when 
used to inoculate an animal. The animal can be a reptile. In some embodiments, 
the animal is an avian. In other embodiments, the animal is a mammal. 
Sometimes, the animal is a human. 

The invention provides peptide fragments of the spike protein from 
20 SARS-CoV, Preferably the peptide fragments are soluble in aqueous solution, 
A peptide fragment of the invention may lack one amino acid residue from the 
amino acid sequence of the frill length spike protein from SARS-CoV. In some 
embodiments, peptide fragments are at least three amino acids in length. In 
other embodiments, peptide fragments are at least 10 amino acids in length. In 
25 some embodiments, peptide fragments are at least 20 amino acids in length. In 
other embodiments, peptide fragments are at least 30 amino acids in length. In 
some embodiments, peptide fragments are at least 40 amino acids in length. In 
other embodiments, peptide fragments are at least 50 amino acids in length. In 
some embodiments, peptide fragments are at least 60 amino acids in length. The 
30 peptide fragments may also be single amino acid unit additions to a fragment of 
a given length. For example, peptide fragment may be 3, 4, 10, 1 1, 21, 22, 31, or 
32 amino acids in length. The peptide fragments of the invention can inhibit 
S ARS Co- V fiision with animal cells or elicit an immune response when used to 
inoculate an animal. Examples of peptides that can elicit an immune response 
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after inoculation of an animal include, for example, the D24 peptide having 
sequence DVQAPNYTQHTSSMRGC (SEQ ID NO:58) and the P540 peptide 
having sequence PSSKRFQPQQFGRDC (SEQ ID NO:59). In some 
embodiments, the peptide fragments of the invention elicit a cellular immune 
5 response when used to inoculate an animal. In other embodiments, the peptide 
fragments of the invention elicit a humoral immune response when used to 
inoculate an animal. The animal can be a reptile. In some embodiments, the 
animal is an avian. In other embodiments, the animal is a mammal. Inftirther 
embodiments, the animal is a human. 
10 The invention provides coupled proteins. The coupled proteins include a 

carrier protein that is coupled to a second polypeptide. Preferably, the carrier 
protein is soluble. In some embodiments, the carrier protein increases an 
immune response to the second polypeptide of the coupled protein when used to 
inoculate an animal. In other embodiments, the carrier protein elicits a cellular 
1 5 immxme response to the second polypeptide of the coupled protein when used to 
inoculate an animal. Li some embodiments, the carrier protein elicits a humoral 
immune response to the second polypeptide of the coupled protein when used to 
inoculate an animal. The second polypeptide can be a polypeptide or a peptide 
fragment of the invention, or a conservative variant thereof The animal can be a 
20 reptile. In some embodiments, the animal is an avian. In other embodiments, 
the animal is a mammal. In further embodiments, the animal is a human. 

The invention provides immunopeptides that include a polypeptide or 
peptide fragment of the invention, or a conservative variant thereof, that is 
coupled to an acetyl group, a picryl group, an arsaniUc acid, or to a sulfanilic 
25 acid. In some embodiments, the immunopeptide is coupled to an acetyl or a 

picryl group. In other embodiments, immunopeptide is coupled to arsaniUc acid 
or sulfanilic acid. Preferably, the immunopeptide is soluble. Preferably, the 
immunopeptide elicits an immune response when used to inoculate an animal. 
In some embodiments, the immunopeptide ehcits a humoral inmnme response 
30 when used to inoculate an animal. In other embodiments, the immunopeptide 

elicits a cellular immune response when used to inoculate an animal. The animal 
can be a reptile. In some embodiments, the animal is an avian. In other 
embodiments, the animal is a mammal. In further embodiments, the animal is a 
human. 
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The invention provides peptidomimetics that are polypeptides or peptide 
fragments of the invention, and conservative variants thereof, in which a peptide 
bond has been replaced with a non-peptide bond. In some embodiments, the 
peptidomimetic can inhibit SARS Co-V fusion with animal cells. In other 
5 embodiments, the peptidomimetic elicits an imutnnne response when used to 
inoculate an animal. For example, the peptidomimetic can elicit a cellular 
immune response when used to inoculate an animal. Alternatively, the 
peptidomimetic eUcits a humoral immune response when used to inoculate an 
animal. The animal can be a reptile. In some embodiments, the animal is an 

10 avian. In other embodiments, the animal is a mammal. In further embodiments, 
the animal is a human. 

The invention provides compositions containing an adjuvant and a 
nucleic acid, polypeptide, a peptide fragment, or a peptidomimetic of the 
invention. In some embodiments, the composition inhibits SARS-CoV fusion 

15 with animal cells. In other embodiments, the composition elicits an immune 

response when used to inoculate an animal. In some embodiments, the immune 
composition elicits a cellular inmiune response when used to inoculate an 
animal. In other embodiments, the immune composition ehcits a humoral 
immune response when used to inoculate an animal. The animal can be a reptile. 

20 In some embodiments, the animal is an avian. In other embodiments, the animal 
is a mammal. In further embodiments, the animal is a human. 

The invention provides nucleic acid segments that encode polypeptides 
and peptide fragments of the invention, and conservative variants thereof 

The invention provides expression cassettes having a promoter that is 

25 operably linked to a nucleic acid segment of the invention. In some 

embodiments, the promoter is constitutive. In other embodiments, the promoter 
is inducible. 

The invention provides nucleic acid constmcts that include a vector alnd a 
nucleic acid segment of the invention. The nucleic acid construct can include an 
30 expression cassette of the invention. In some embodiments, the vector can be a 
virus. In other embodiments, the vector is a plasmid. In further embodiments, 
the vector is an expression vector. 

The invention provides a recombinant vims that includes a viral vector 
and a nucleic acid segment of the invention. In some embodiments, the viral 
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vector is a herpes virus. In other embodiments, the viral vector is a canarypox 
viras. In other embodiments, the viral vector is an adenovirus. In further 
embodiments, the viral vector is a vaccinia vims. 

The invention provides a viral vaccine against S ARS that includes a viral 
5 vector, a nucleic acid segment of the invention, and a phamiaceutical carrier. In 
some embodiments, the viral vector is a herpes virus. In other embodiments, the 
viral vector is a canarypox virus. In other embodiments, the viral vector is an 
adenoviras. hi further embodiments, the viral vector is a vaccinia virus. 
Preferably, the pharmaceutical carrier is formulated for injection. Preferably, the 
10 viral vaccine elicits an immune response when used to inoculate an animal. In 
some embodiments, the viral vaccine elicits a cellular immune response when 
used to inoculate an animal. In other embodiments, the viral vaccine elicits a 
humoral immune response when used to inoculate an animal. The animal can be 
a reptile. In some embodiments, the animal is an avian. In other embodiments, 
15 the animal is a mammal. In fiuther embodiments, the animal is a human. 

The invention provides a peptide vaccine against S ARS that includes a 
peptidomimetic, polypeptide or a peptide JBragment of the invention, or a 
conservative variant thereof, and a pharmaceutical carrier. Preferably, the 
pharmaceutical carrier is formulated for injection. Preferably, the peptide 
20 vaccine is formvdated in xmit dosage form. Preferably, the peptide vaccine elicits 
an immune response when used to inoculate an animal. In some embodiments, 
the peptide vaccine elicits a cellular immime response when used to inoculate an 
animal. In other embodiments, the peptide vaccine elicits a humoral immune 
response when used to inoculate an animal. The animal can be a reptile. In 
25 some embodiments, the animal is an avian. In other embodiments, the animal is 
a mammal. In further embodiments, the animal is a human. 

The invention provides a microorganism vaccine against S ARS that 
includes a microorganism that expresses a polypeptide or a peptide fragment of 
the invention, or a conservative variant thereof, and a pharmaceutical carrier. 
30 Preferably, the microorganism is attenuated. In some embodiments, the 

microorganism is Sahnonella. In other embodiments, the microorganism is 
Listeria. In further embodiments, the microorganism is Listeria monocytogenes. 
In some embodiments, the pharmaceutical carrier is formulated for injection. In 
other embodiments, the pharmaceutical carrier is formulated for oral 
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administration. Preferably, the microorganism vaccine is formulated in unit 
dosage form. Preferably, the microorganism vaccine elicits an imtnmie response 
when used to inoculate an animal. In some embodiments, the microorganism 
vaccine elicits a cellular immune response when used to inoculate an animal, hi 
5 other embodiments, the microorganism vaccine elicits a humoral immune 
response when used to inoculate an animal. The animal can be a reptile. In 
some embodiments, the animal is an avian. In other embodiments, the animal is 
a mammal. In further embodiments, the animal is a human. 

The invention provides a DNA vaccine against S ARS that includes a 

1 0 vector into which is inserted a nucleic acid segment of the invention, and a 

pharmaceutical carrier. The DNA vaccine may include an adjuvant. The DNA 
vaccine may include a myonecrotic agent. For example, the myonecrotic agent 
can be bupivicaine. In other embodiments, the myonecrotic agent is cardiotoxin. 
The vector can, for example, be a vims. In other embodiments, the vector is a 

15 bacteriophage. In further embodiments, the vector is a plasmid. The vector 
containing the insert can be prepared in a eukaryotic cell. However, in some 
embodiments, the vector containing the insert is prepared in a prokaryotic cell. 
For example, the vector containing the insert can be prepared in a bacterium. In 
some embodiments, the pharmaceutical carrier is formulated for mucosal 

20 delivery. In other embodiments, the pharmaceutical carrier is fomiulated for 
injection. Preferably, the DNA vaccine is formulated in unit dosage form. 
Preferably, the DNA vaccine elicits an immxme response when used to inoculate 
an animal. In some embodiments, the DNA vaccine elicits a hxmioral immune 
response when used to inoculate an animal. In other embodiments, the DNA 

25 vaccine elicits a cellular immune response when used to inoculate an animal. 
The animal can be a reptile. In some embodiments, the animal is an avian. In 
other embodiments, the animal is a mammal. In further embodiments, the 
animal is a human. 

The invention provides an antibody that binds to a polypeptide or peptide 

30 fragment of the invention, or a conservative variant thereof. In some 

embodiments, the antibody is an antigen-binding antibody fragment. In other 
embodiments, the antibody is a polyclonal antibody. In further embodiments, 
the antibody is a single-chain antibody. In other embodiments, the antibody is a 
monoclonal antibody. In some preferred embodiments, the antibody is a 
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humanized antibody. The antibody may be coupled to a detectable tag. For 
example, the detectable tag can be a radiolabel. In some embodiments, the 
detectable tag is an affinity tag. hi other embodiments, the detectable tag is an 
enzyme, hi ftirther embodiments, the detectable tag is a fluorescent protein, hi 
5 some preferred embodiments, the detectable tag is a fluorescent marker. The 
antibody may also be coupled to a toxin. 

The invention provides aptamers that bind to a polypeptide or peptide 
fragment of the invention, or a conservative variant thereof. The aptamer may 
be coupled to a detectable tag. For example, the detectable tag is a radiolabel. 
10 Li some embodiments, the detectable tag is an affinity tag. hi other 

embodiments, the detectable tag is an enzyme. In further embodiments, the 
detectable tag is a fluorescent protein. In some preferred embodiments, the 
detectable tag is a fluorescent marker. The aptamer may also be coupled to a 
toxin. 

15 The invention provides a pharmaceutical composition or a kit containing 

an antibody, S polypeptide or aptamer of the invention and a pharmaceutical 
carrier. Preferably, the pharmaceutical composition is formtilated for injection. 

Brief Description of the Figures 
20 This patent or application file contains at least one drawing executed in 

color. Copies of this patent or patent application pubhcation with color 
drawuig(s) will be provided by the Office upon request and payment of the 
necessary fee. 

Fig. 1 A illustrates an agarose gel electrophoresis of a DNA construct 
25 having an insert that encodes the spike protein of the invention. Lanes fi-om left 
to right: Lane 1 is a one kb DNA ladder (markers from bottom to top - 0.5, LO, 
L6, 2.0, 3.0, 4.0); Lane 2 shows the DNA construct digested with BamHI/Xbal, 
resulting in the distinctive vector band (upper band) and the DNA fragment that 
encodes the spike protein (lower band); Lane 3 shows the DNA construct 
30 digested with Hindlll which produced a smaller band and a larger band as 

expected due to the presence of a HmdHI site in the vector and within the DNA 
fragment encoding the spike protein. 

Fig. IB provides a schematic diagram of a monomer of the full-length 
SARS-CoV S glycoprotein showing various soluble polypeptide fragments after 
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removal of the signal sequence (residues 1-16, SEQ ID NO:60). The soluble 
fragments are spike protein fragments named "S" followed by numbers 
corresponding to the spike protein amino acids that constitute the terauni of the 
fragment. Thus, "S756" is a soluble spike protein fragment beginning at amino 
5 acid 17 Oust after the signal sequence) and ending at amino acid 756. "TM" 
denotes the transmembrane segment and the arrow indicates a possible cleavage 
site within amino acid positions 758-761 (sequence RNTR). .*ltBD" indicates 
the potential receptor-binding domain that is withm amino acid positions 272- 
537 (SEQ ID NO:57), likely between a residue downstream from position 303 

10 and a residue upstream of position 537 (SEQ ID NO:61). 

Fig. 2 illustrates a denaturing polyacrylamide gel electrophoresis (SDS- 
PAGE) of the expression of a peptide fragment of the spike protein from SARS- 
CoV in Escherichia coli. The peptide fragment corresponds to amino acids 17- 
446 of SEQ ID NO: 1. The nucleic acid segment encoding amino acids 17-446 

1 5 was cloned into a pRSET vector to create pRSET-S( 17-446), which was 

expressed in BL21DE3 cells. Numbers and arrows on the left indicate molecular 
weight markers in kilodaltons. The lanes contain the following polypeptides: M 
- molecular weight markers; lanes 1 and 2 - polypeptides of control E. coli 
containing the pRSET vector without the nucleic acid segment encoding amino 

20 acid residues 17-446 of SEQ ID NO: 1 and without isopropylthiogalactoside 

(IPTG) induction; lane 3 - polypeptides of control E. coli containing the pRSET 
vector without the nucleic acid segment encoding amino acid residues 17-446 of 
SEQ ID NO: 1 but with IPTG induction; lane 4 - analysis of E. coH containing 
the pRSET vector with a nucleic acid segment encoding amino acid residues 17- 

25 446 of SEQ ID NO: 1, and with IPTG induction. The arrow on the right side 

indicates the position of a peptide fragment corresponding to amino acid residues 
17-446 of SEQ ID NO: 1 as expressed in E. coli. 

Fig. 3 illustrates a slot blot analysis of the expression of the indicated 
peptide fragments of the spike protein from SARS-CoV in mammalian cells, 

30 Nucleic acid segments coding for the peptide fragments were cloned into a 

pSecTag2B vector to express peptide fragments having the mouse k chain leader 
sequence at the N-terminus for secretion, and a c-Myc epitope plus ahistidine 
tag at the C-terminus for detection and affinity purification. The nucleic acid 
constructs were transformed into HEK293 and VeroE6 cells. Expression of the 
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indicated peptide fragments was examined through use of slot blot analysis with 
an anti-c-Myc antibody. The numbers on the left and right indicate the amino 
acid residues included within the detected peptide fragments. The left column 
represents expression of the peptide fragments in HEK293 cells. The right 
5 column represents expression of the peptide fragments m VeroE6 cells. The 
upper half represents samples obtained from medium in which the cells were 
grown (secreted proteins), and the lower half represents samples obtained from 
cell lysate (intracellular portion). PC is a positive control, provided by the 
manufacturer of the plasmid that contains PSA with a c-Myc tag at the C- 
10 terminus. NC is a negative control that contains the ftill length spike protein 
from SARS-CoV that lacks a c-Myc epitope or histidine tag. 

Fig. 4 A illustrates a slot blot analysis of the expression of the indicated 
peptide fragments from the spike protein from SARS-CoV in hxmian 293 or 
Monkey VeroE6 cells. Supematants of 293 and Vero E6 cells transfected with 
15 plasmids encoding S fragments (S276, S537, and S756) in the absence or 

presence of T7 polymerase expressed by recombinant vaccinia virus (VTF7.3) 
were transferred to nitrocellulose membranes and detected with anti-c-Myc 
epitope antibody. The nimibers on the left and right indicate the amino acid 
residues included within the detected peptide fragments. PSA PC is a positive 
20 control that contains PSA with a c-Myc tag at the C-terminus. pCDNA-S NC is 
a negative control that contains the fliU length spike protein from SARS-CoV 
that lacks a c-Myc epitope or histidine tag. The lanes are as follows: (1) human 
293 cells that were not infected with a VTF7.3 vaccinia virus, (2) human 293 
cells that were infected with a VTF7.3 vaccinia virus, (3) monkey VeroE6 cells 
25 that were not infected with a VTF7.3 vaccinia virus, and (4) monkey VeroE6 
cells that were infected with a VTF7.3 vaccinia virus. 

Fig. 4B Supematants from transfected cells as described above for Fig. 
4A were incubated with Ni-NTA agarose beads, washed, and subjected to 
Western blotting with the same anti-c-Myc epitope antibody as in Fig. 4A. 
30 Fig. 4C illustrates detection of S fragments by two rabbit polyclonal 

antibodies raised against peptides corresponding to sequences starting at residues 
24 (D24, middle panel) and 540 (P540, right panel), respectively. The left panel 
shows for comparison Western blot where S537 and S756 were detected by the 
anti-c-Myc epitope antibody. 
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Fig. 5 illustrates that the full-length membrane-associated S protein is 
expressed on the surface of cells, as shown by flow cytometry using the rabbit 
polyclonal antibody P540. A nucleic acid encoding the full-length S 
glycoprotein was used to transfect 293 cells, which were then infected with 
5 VTF7.3. Cells were collected and incubated with P540 polyclonal antibody plus 
anti-rabbit secondary antibody conjugated with FITC, washed, and subjected to 
flow cytometry analysis. The same plasmid used to express S but without the 
nucleic acids for S was used to transfect cells in a control experiment denoted as 
negative control (NC); cells with nucleic acids encoding the full-length S 
10 glycoprotein are denoted as S, 

Fig. 6 A and 6B illustrate that substantially no cleavage of the S 
glycoprotein occurs naturally. Western blots of supematants from transfected 
293 cells expressing S756, Se, and cell lysate of 293 cells expressing the S 
glycoproteins using the P540 antibody are shown. Close to background level 
15 cleavage of S and Se was observed. Fig. 6A shows a Western blot of samples 
kept for three days at 4 ""C before analysis to monitor the effect of nonspecific 
protease activity on the cleavage pattem. In contrast. Fig. 6B shows blots with 
samples used immediately after preparation. 

Fig. 7 A-C shows that cell fusion is mediated by the S glycoprotein. A 
20 pCDNA3-based plasmid without S insert was used as plasmid control, and 

fusion between S-expressing cells with ACE2-ecto expressing cells was used as 
negative control. The pCDNA3-ACE2-ecto construct expresses just the ACE2 
soluble ecto domain tagged with C9 peptide. Fig.7A illustrates that there was no 
syncytium formation between 293T cells transfected with pSecTag2B-S and 
25 pCDNA3-ACE2-Ecto. Li contrast, Fig.7B illustrates syncytiimi formation 
between 293T cells transfected with pSecTag2B-S and pCDNA3-ACE2, 
respectively. Fig. 7C graphically illustrates cell fusion as measured by a reporter 
gene-based assay. As shown, S glycoprotein expressed in bothpCDNA3 and 
pSecTag2B vectors can be detected in a /?-gal reporter gene-based cell-cell 
30 fusion assay. 

Fig. 8A-C shows that the S glycoprotein receptor-binding domain (RBD) 
is localized between residues 272 and 537. Fig. 8 A illustrates binding of two 
different S soluble fragments (S537 and S756) to 293 and Vero E6 cells. Fig. SB 
illustrates binding of various S fragments to Vero E6 cells. The background 
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OD405 measured for the negative control was subtracted from the OD405 values 
of each S fragment. The resulting OD405 for each fragment was then presented as 
a percentage of the OD405 for S537, Fig. 8C illustrates which S polypeptide 
fragments interact with purified soluble ACE2 as measured by ELIS A. In all 
5 experiments, the negative control (NC) represents sample processed exactly the 
same way as the others except that the plasmid used for transfection did not 
encode any protein. Data shown here represent at least three independent 
experiments. OD405 for all samples is presented as percentages of the OD405 for 
S537. 

1 0 Fig. 9 A-D illustrates that dimerization occurs between the N terminal 

fragments of the SARS-CoV S glycoprotein as demonstrated by co- 
immunoprecipitation and cross-linking. All N-terminal fragments except the 
smallest fragment (S3 17-5 17) containing the receptor binding domain were 
coimmunoprecipiated with S756 by the P540 antibody. The P540 antibody is a 

15 rabbit polyclonal antibody that was developed against a peptide containing 

residues 540-555 of the S glycoprotein and it binds the S756 polypeptide but not 
the N-terminal fragments. 

In Fig. 9A, plasmids encoding N-terminal fragments (denoted by the 
number of the ending amino acid residue or the nimiber of the starting and 

20 ending residue) were used to transfect 293T cells alone (left six lanes) or in 

combination (right four lanes) with a S756-encoding plasmid. These cells were 
then infected with the vaccinia virus VTF7.3. After incubation, the culture 
medium was collected and subjected to Westem blot analysis using a mouse 
anti-c-Myc epitope antibody that recognizes all fragments. 

25 Fig. 9B shows that all N-terminal S fragments, except the smallest 

fragment (S3 17-5 17) that contained the receptor binding domain, were 
coimmunoprecipiated with S756 by the P540 antibody. The same medium 
samples used in Fig. 9 A were subjected first to immunoprecipitation with the 
P540 polyclonal antibody that recognizes only S756. These immunoprecipitates 

30 were then subjected to Westem blot analysis using the anti-c-Myc epitope 
antibody to confirm that the N-terminal fragments coimmunoprecipitated. 

Fig. 9C shows that a new band with a molecular weight corresponding to 
a dimer forms in the presence and absence of DTT. To rule out the possibility of 
nonspecific disulfide bond formation that may lead to coimmunoprecipitation. 
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DTT was included in one of the coimmxmoprecipitation experiment. DTT had 
no effect on either immunoprecipitation or coimmunoprecipitation of secreted 
S756 (left lanes) or S756+S276 (right lanes). Medium samples containing 
secreted S756 (left lanes) or S756+S276 (right lanes) fragments were subjected 
5 to immunoprecipitation with P540 in the presence or absence of 2 mM DTT. 

Fig. 9D illustrates the size of the S polypeptide oligomers. The S537 
fragment was cross-hnked with BS^ (Pierce, Rockford, IL) as described in the 
Examples and a Westem blot was prepared after SDS-PAGE separation and the 
anti-c-Myc antibody was used for detection of the S537 monomer and its 
1 0 oligomers. As shown in the right lane of Fig. 9D, a new band appears when the 
crosslmking reagent is added. The new band had a molecular weight 
corresponding to a dimer but not of higher order oligomers. 

Fig. lOA illustrates dimerization of the N terminal fragment S537 as 
detected by size-exclusion chromatography. The elution profiles of S537 and 
15 S3 1 7-5 1 7 are shown with arrows and numbers indicating the position and 
molecular weight at which standard calibration proteins were eluted. 

Fig. lOB provides westem blots of fractions collected for S537 and S317- 
517 by using an anti-c-Myc epitope antibody. 

Fig. 1 1 A-B illustrates that the extreme N terminal domain is required for 
20 the S glycoprotein mediated cell-cell fiision. Fig. 1 1 A provides a schematic 

representation of the S glycoprotem deletion mutants and a summary of the data 
from a cell-cell fiision assay where RBD denotes the approximate position of the 
receptor binding domain. The presence of signal due to fiision is denoted by a 
plus (+) and lack of measurable signal above backgroxmd levels by a minus (-). 
25 Only wild type polypeptides with amino acids 17-1255 had fijsion activity. 
Neither of the deletion mutants having amino acids 103-1255 (Dell) or 311- 
1255 (Del2) had fiasion activity. Fig. 1 IB shows the levels of expression of fliU 
length and deletion mutants of the S glycoprotein as measured by Westem 
analysis. Equal amount of cell lysates were loaded for each sample and the rabbit 
30 polyclonal antibody P540 was used for detection. Fig. 1 1 C illustrates that the 
fiiU length S glycoprotein and the Dell and Del 2 deletion mutants are expressed 
on the cell surface as measured by flow c3^ometry. The level of surface 
expression was low although the negative control where the cells were 
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transfected with an empty plasmid was clearly distinguishable to the left of the 
other three curves. 

Fig. 12A-B illustrates that dimeric SI binds more efficiently to the 
receptor ACE2 than monovalent fragments containing the receptor binding 
5 domain. Fig. 12 A shows the relative levels of expression of different S 

fragments as detected by ELIS A using 200 {il of culture supematants from cells 
transfected with S276, S3 19-5 18 and S537 constructs. Anti-His and anti-c-Myc 
epitope antibodies were used in a sandwich ELIS A to detect the levels of 
secreted tagged S proteins. Fig. 12B shows the level of binding by S fragments 

10 to ACE2 as measured by ELIS A. The tagged ACE2 was boimd to plates by an 
anti-C9 antibody that had been previously coated on the plates. The supematants 
from cell cultures where the cells were transfected with various S proteins were 
mixed and incubated in ELIS A plates either with (hatched bars) or without (open 
bars) anti-c-Myc antibody. The highest level of expression or binding is assumed 

15 to be 100 %. As shown the S537 fragment with both the N-terminal 

dimerization domain and the receptor binding domain, binds ACE2 more 
efficiently than does the S3 19-5 18 fragment that has only the receptor buiding 
domain. 

Fig. 13A-B illustrates that the soluble S ectodomain is trimeric under the 
20 conditions of size-exclusion chromatography. In Fig. 13 A, purified Se was run 
on a gel filtration column that was calibrated by using proteins v^th known 
molecular weight. BSA in equal amount was included as an internal control. In 
Fig. 13B, different fractions were collected from the gel filtration column and 
analyzed by Westem blot. Two bands S polypeptide are detected in some 
25 fractions that contain Se fragments of the indicated molecular weights, 

representing the Se fragment alone (lower band) and its aggregates (upper band). 

Fig. 14A illustrates that a DNA vaccine of the invention can elicit very 
high titer anti-SARS-CoV sera in mice. Mice 1 A-5A were immunized with 
DNA encoding the S3 19-5 18 fragment that contains the spike protein receptor 
30 binding domain (RED). Mice 1B-5B were immunized with RBD-encoding 

DNA (the S3 19-51 8 fragment) fiised to a nucleic acid encoding an Fc fragment. 
Mice 1C-3C received plasmid only (no S fragment DNA), Anti-sera were 
collected and tested via ELISA to ascertain the titer of the different isolates. In 
Fig. 14A, the first number denotes an individual mouse, the letter denotes the 
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respective immunization group, and the last number denotes the dilution used. 
Anti-sera were diluted by factors of 50, 250, 1250 and 7250, as shown on the x- 
axis of the bar graph. These data indicate that immunization with DNA 
encoding the receptor binding domain of the S protein induces a strong immime 
5 response against SARS-CoV. 

Fig. 14B illustrates that anti-sera from mice immunized with RBD- 
encoding DNA can prevent S-mediated cell fusion. Cells (293T) were incubated 
with anti-sera from mice immunized with DNA encoding a spike protein 
receptor binding domain polypeptide (S3 19-5 1 8) fragment and then the cell 

10 suspension was mixed with cells expressing S protein. Fusion was measured as 
described in Example 20 (see also, Xiao et al. BBRC 2003). The percentage , 
(where 1=100%) of activity for each fusion reaction is plotted on the y-axis, 
where the percentage of the fusion without any inhibition was designated as 
100%. PC denotes positive control where no serum was added. For mice sera #1 

15 to #2 in each group, serum dilution factors of 10 (designated 0.1), 100 

(designated 0.01), and 1000 (designated (0.001) were used. For mice sera #3-#5 
in groups A and B, and #3 in the control group, dilution factors of 20 (designated 
0.05) and 100 (designated 0.01) were used. These data indicate that 
unmunization with DNA encoding the receptor binding domain of the S protein 

20 could prevent SARS-CoV infection. 

Fig. 15 illustrates that soluble S glycoprotein fragments inhibit S- 
mediated cell fusion. 10 ug/ml of various S fragments were incubated with 
ACE2-expressing cells first for 10 min at room temperature. The ACE2- 
expressing cells were then mixed with S expressing cells and the fusion assay 

25 was carried out as described in the Examples. The Y-axis is the OD595 for each 
sample after the background noise was subtracted. Nxmibers of each construct 
represent the starting and ending residues of the respective polypeptide. 

Detailed Description of the Invention 
30 SARS represents an important public health coficem. Methods to 

diagnose and treat persons who are infected witti SARS-CoV provide the 
opportunity to either prevent or control further spread of infection by SARS- 
CoV. These methods are especially important due to the ability of SARS-CoV 
to infect persons through an airborne route. The present invention provides 
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nucleic acids that encode segments of the amino acid sequence of the spike 
protein of SARS-CoV. The present invention also provides polypeptides that 
.correspond in amino acid sequence to segments of the amino acid sequence of 
the spike protein of S ARS-CoV. The invention also provides peptide fragments 
5 and conservative variants of the spike protein of S ARS-Co V, in addition to 
coupled proteins and peptidomimetics that have portions which correspond in 
amino acid sequence to the spike protein. 

The spike protein is important because it is present on the outside of 
intact SARS-CoV. Thus, it presents a target that can be used to inhibit or 
10 eliminate an intact virus before the virus has an opportunity to infect a cell. 

The nucleic acids and polypeptides of the invention offer advantages 
over the full length spike protein because the nucleic acids are easy to produce 
and the polypeptides of the invention are produced in large amounts in soluble 
form. The polypeptides of the invention offer additional advantages over the 
1 5 native spike protein because they can be made to have increased resistant to 

degradation when administered to an animal. The polypeptides of the invention 
can also be formulated to increase their antigenicity to make them more efficient 
antigens to elicit an immune response when administered to an animal, such as a 
human. 

20 Accordingly, the invention provides nucleic acids and polypeptide 

antigens that may be used to formulate vaccines and immune compositions that 
can be used to immimize and treat persons who are infected with SARS-CoV. In 
addition, the invention provides antibodies that bind to the spike protein of 
SARS-CoV which may be used to diagnose, immunize, and treat persons 

25 infected with SARS-CoV. 

Definitions: 

An "adjuvant" is generally defined as a substance that nonspecifically 
enhances the immune response to an antigen. A variety of adjuvants may be 
30 employed with the immunopeptides and immunofiragopeptides of this invention. 
Most adjuvants contain a substance designed to protect the antigen from rapid 
catabolism, such as aluminum hydroxide or mineral oil, and a stimulator of 
immune responses, such as lipid A, Bortadella pertussis ox Mycobacterium 
tuberculosis derived proteins. Suitable adjuvants are coromercially available as. 
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for example, Frevmd's Incomplete Adjuvant and Complete Adjuvant (Difco 
Laboratories, Detroit, Mich.); Merck Adjuvant 65 (Merck and Company, Inc., 
Rahway, N J.); alimiinum salts such as aluminum hydroxide gel (alum) or 
aluminum phosphate; salts of calcium, iron or zinc; an insoluble suspension of 
5 acylated tyrosine; acylated sugars; cationically or anionically derivatized 
polysaccharides; polyphosphazenes; biodegradable microspheres; 
monophosphoryl lipid A and quil A, Cytokines, such as GM-CSF or interleukin- 
2, -7, or -12, may also be used as adjuvants. 

An "animal" refers to an organism that can mount an immune response 
10 upon antigenic challenge. For example, reptiles, avians, and mammals are able 
to produce antibodies in response to an antigenic challenge. Antibodies raised in 
non-hxmian organisms are thought to be useful in diagnostic assays to reduce or 
eliminate cross-reactivity. 

An "aptamer" is a peptide, polypeptide or nucleic acid (RNA or DNA) 
1 5 that binds to a polypeptide or peptide fragment of the invention. 

A "carrier protein" refers to a polypeptide that can be coupled with a 
polypeptide or a peptide fragment of the invention to form a coupled protein. A 
carrier protein may be coupled to a polypeptide or peptide fragment in order to 
increase the solubility or the immunogenicity of the polypeptide or peptide 
20 fragment. A carrier protein may also be coupled to a polypeptide or peptide 
fragment to provide a tag which provides for separation or detection of the 
coupled protein. For example, biotin may be used as a carrier protein that is 
, coupled to a polypeptide or peptide fragment to create a coupled protein which 
can then be isolated through interaction with avidin, or detected through use of a 
25 fluorescently tagged avidin. In another example, a carrier protein that is bound 
by an antibody can be coupled to a polypeptide or peptide fragment to create a 
coupled protein that is bound by the antibody which binds to the carrier protein 
of the coupled protein. 

The invention encompasses isolated or substantially purified nucleic 
30 acids, peptides, polypeptides or proteins. In the context of the present invention, 
an "isolated" nucleic acid, DNA or RNA molecule or an "isolated" polypeptide 
is a nucleic acid, DNA molecule, RNA molecule, or polypeptide that exists ^art 
from its native environment and is therefore not a product of nature. An isolated 
nucleic acid, DNA molecule, RNA molecule or polypeptide may exist in a 
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purified form or may exist in a non-native environment such as, for example, a 
transgenic host cell. A "purified" nucleic acid molecule, peptide, polypeptide or 
protein, or a fragment thereof, is substantially free of other cellular material, or 
culture medixmi when produced by recombinant techniques, or substantially free 
5 of chemical precursors or other chemicals when chemically synthesized. In one 
embodiment, an "isolated" nucleic acid is free of sequences that naturally flank 
the nucleic acid sequences located at the 5' and 3* ends of the nucleic acid) 
in the genomic DNA of the organism from which the nucleic acid is derived. 
For example, in various embodiments, the isolated nucleic acid molecule can 

10 contain less than about 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, 0,5 kb, or 0.1 kb of 

nucleotide sequences that naturally flank the nucleic acid molecule in genomic 
DNA of the cell from which the nucleic acid is derived. A protein, peptide or 
polypeptide that is substantially free of cellular material includes preparations of 
protein, peptide or polypeptide having less than about 30%, 20%, 10%, or 5% 

15 (by dry weight) of contaminating protein. When the protein of the invention, or 
biologically active portion thereof, is recombiaantly produced, preferably culture 
medium represents less than about 30%, 20%, 10%, or 5% (by dry weight) of 
chemical precursors or non-protein-of-interest chemicals. 

The terms polypeptide, peptide and protein are used interchangeably 

20 herein. 

A peptide or polypeptide "fragment" as used herein refers to a less than 
full length peptide, polypeptide or protein. For example, a peptide or 
polypeptide fragment can have is at least about 3, at least about 4, at least about 
5, at least about 10, at least about 20, at least about 30, at least about 40 amino 

25 acids in length, or single unit lengths thereof For example, fragment may be 6, 
7, 8, 9, 10, 1 1, 12, 13, 14, 15, 16, 17, or more amino acids in length. There is no 
upper limit to the size of a peptide fragment. However, in some embodiments, 
peptide fragments can be less than about 500 amiuo acids, less than about 400 
amino acids, less than about 300 amino acids or less than about 250 amino acids 

30 in length. Preferably the peptide fragment can elicit an immune response when 
used to inoculate an animal, A peptide fragment may be used to elicit an 
immune response by inoculating an animal with a peptide fragment in 
combination with an adjuvant, a peptide fragment that is coupled to an adjuvant, 
or a peptide fragment that is coupled to arsanilic acid, sulfaniUc acid, an acetyl 
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group, or a picryl group. A peptide fragment can include a non-amide bond, and 
can be a peptidomimetic. 

Hie term "soluble" as used herein refers to the ability of a polypeptide to 
be solvated in an aqueous solution. For example, a soluble peptide can be mixed 
5 with an aqueous medium such that at least a detectable portion of the peptide is 
present in the aqueous medium. The peptide may be detected through use of 
common techniques, such as absorbance of light, fluorescence, the ^bility to bind 
dyes, the ability to reduce silver ions, and the like. 

The term "specifically binds" refers to an antibody that binds to a single 
10 epitope, but which does not bind to more thaa one epitope. Accordingly, an 
antibody that specifically binds to a polypeptide will bind to an epitope that 
present on the polypeptide, but which is not present on other polypeptides. 

L Polypeptides, peptide fragments, coupled proteins, immunopeptides. and 

15 peptidomimetics of the invention 

The invention provides a polypeptide which has an amino acid sequence 
that corresponds to the amino acid sequence of the spike protein from the virus 
(SARS-CoV) that is etiologically linked to severe acute respiratory syndrome 
(SARS). A representative amino acid sequence is provided by SEQ ID NO: 1, 

20 whose sequence is provided below for easy reference. 



1 


MFIFLLPLTL 


TSGSDLDRCT 


TFDDVQAPNY 


TQHTSSMRGV 


41 


YYPDEIPRSD 


TLYLTQDLFL 


PFYSNVTGFH 


TINHTFGNPV 


81 


IPFKDGIYFA 


ATEKSNWRG 


WVFGSTMNNK 


SQSVIIINNS 


121 


TNWIRA.CNF 


ELCDNPFFAV 


SKPMGTQTHT 


MIFDNAFNCT 


161 


FEYISDAFSL 


DVSEKSGNFK 


HLREFVFKNK 


DGFLYVYKGY 


201 


QPIDWRDLP 


SGFNTLKPIF 


KLPLGINITN 


FRAILTAFSP 


241 


AQDIWGTSAA 


AYFVGYLKPT 


TFMLKYDENG 


TITDAVDCSQ 


281 


WPLAELKCSV 


KSFEIDKGIY 


QTSNFRWPS 


GDWRFPNIT 


321 


NLCPFGEVFN 


ATKFPSVYAW 


ERKKISNCVA 


DYSVLYNSTF 


361 


FSTFKCYGVS 


ATKLNDLCFS 


NVYADSFWK 


GDDVRQIAPG 


401 


QTGVIADYNY 


KLPDDFMGCV 


LAWNTRNIDA 


TSTGNYNYKY 


441 


RYLRHGKLRP 


FERDISNVPF 


SPDGKPCTPP 


ALNCYWPLND 


481 


YGPYTTTGIG 


YQPYRVWLS 


FELLNAPATV 


CGPKLSTDLI 


521 


KNQCVNFNFN 


GLTGTGVLTP 


SSKRFQPFQQ 


PGRDVSDFTD 



20 



wo 2005/010034 



PCT/US2004/023345 



561 


SVRDPKTSEI 


xjx^ X 1.^ ^ v^jrix VjVj 


VSVTTPGTNA 

V V X XJT V7XX\^~3k. 


S S E VA VT 1 YOD 

O O XJ V XT. V xJ X ^J-^ 


601 


VNCTDVSTAI 


HADQLTPAWR 


lYSTGNlSrVFQ 


TOAGCLIGAE 


641 


HVDTSYECDI 


P 1 GAGT CAS Y 


XX X V KJ XJ XJ X v. W X 1^ 


OTCS T VAYTMS 


681 


LGADS S 1 AYS 


NNTTAT PTNF 


S T S T TTEVMP 

tj X 1^ X X X X_l V l^ix 


VSMAKTSVDG 


721 


NMY T PGD S TE 


PANT .T iT lO YGfi 

V-^ijX>J XJ I.J.I, X yJtJ 




G T A A FODRNT 


761 


REVFAQVKQM 


YKTPTLKYFG 


GPNFSOILPD 

>a7X XM X X« J— IX^ x»/ 


PLKPTKRS FT 

4^ XJ XXx X X^XVi X X> 


801 


EDLLFNKVTL 


ADAGFMKOYG 


E CliGD I NARD 

J — 1 V^XJVJXa/ X XNXxXVX.^ 


Ij T CAOKFNGTj 


841 


TVLPPLLTDD 


MIAAYTAALV 


S GTATAGWTF 

LJ VJ X X jTaVJ V I X X 


GAGAATiO I PF 


881 


AMOMAYRPNG 


I GVTONVL YE 

•X VJ V X ^y/Xil V XJ X XJ 


Xy\^X\.S^ X xTXN \^ X Xil 


xvn.jL X Vf J— 1 o J— 1 


921 


TTT S T A TjGKTj 

A. A. X, UJ J.X*\X. JVJXXJJ 


ODWKrmsjAnA 

V^x^ V V Xii^X^lx^^^n 


XjXM X XJ V xS.^ J_l o o 




961 


DI LSRIiDKVE 


AE VO T DRIj T T 

XjkJ-ZJ V X X»/X\.XJ X X 


GRT lOS T lOT YV 


TOnT.TR AAET 


1001 


RASANLAATK 


MSECVTjGOSK 


RVDPGGKGYH 

XN. V ±JI^ \^\J1\.\J X XX 


TiM^FPDAAPR 


1041 


GWFLHVTYV 


P S OERNFTTA 


PAT CHEGKAY 

X JTjl X V^XXX-J\JX\x^X 


F PREGVF VFKT 

X XXVX_jVJVx V X^ X>i 


1081 


GTSWFITORN 


PPSPOTTTTD 

X X C \/ X X X X JLJ 


XN X X V O VJxN \^±y V 


V X \J X X XMXM X V X 


1121 


DPLQPELDSF 


KEELDKYFKN 


HTSPDVDLGD 


ISGINASWN 


1161 


IQKEIDRLNE 


VAKNLNESLI 


DLQELGKYEQ 


YIKWPWYVWL 


1201 


GFIAGLIAIV 


MVTILLCCMT 


SCCSCLKGAC 


SCGSCCKFDE 


1241 


DDSEPVLKGV 


KLHYT 







20 The invention also provides peptide fragments which have amino acid 

sequences that correspond to a fragment of the spike protein from the vims 
(SARS-CoV ) that is etiologically linked to severe acute respiratory syndrome 
(SARS). Such amino acid sequences include those represented by SEQ ID NOs: 
13, 14, 15, 20-59, and 61-63. The peptide fragments of SEQ ID NO: 1 can also 

25 be three or more amino acids in length, and produce an immune response when 
used to immunize an animal. These peptide fragments are exemplified by those 
that are three amino acids in length, or single amino acid imits of greater length, 
such as 4, 5, 6, 7, 8, 9, 10 amino acids in length, and an amino acid sequence that 
lacks one amino acid from the amino acid sequence corresponding to SEQ ID 

30 NO: 1. 

The invention also provides coupled proteins having a carrier protein 
coupled to a polypeptide or peptide fragment of the invention. The carrier 
protein may be used to increase the solubility of the coupled protein. The carrier 
protein may also be used to increase the irmnunogenicity of the coupled protein 
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to increase production of antibodies that bind to the polypeptide or peptide 
fragment of the invention. The carrier protein may also be used to provide for 
the separation or detection of a coupled protein. Accordingly, a coupled protein 
can be detected or isolated by interaction with other components that bind to the 
5 carrier protein portion of the coupled protein. For example, a coupled protein 
having avidin as a carrier protein can be detected or separated with biotin 
through use of known meliiods. Numerous carrier proteins may be used to 
create coupled proteins of the invention. Examples of such carrier proteins 
include, keyhole limpet hemacyanin, bovine serum albumin, ovalbumin, mouse 

10 sermn albumin, rabbit serum albumin, and the like. A carrier protein may be 
coupled to a polypeptide or peptide fragment of the invention by creation of a 
fiision protein through use of recombinant methods. A carrier protein may also 
be coupled to a polypeptide or peptide fragment of the invention through use of 
chemical linking methods, or through use of a chemical linker. Such coupling 

15 methods are known in the art and have been described. Harlow et al.. 

Antibodies: A Laboratory Manual, page 319 (Cold Spring Harbor Pub. 1988); 
Taylor, Protein Immobilization, Marcel Dekker, Inc., New York, (1991). 

The invention provides immunopeptides having a polypeptide or a 
peptide fragment of the invention coupled to arsanilic acid, sulfanilic acid, an 

20 acetyl group, or a picryl group. Methods to couple such groups to peptides are 
known and have been reported. Weigle, J. Exn, Med., 116:913-928 (1962); 
Weigle, J. Exp. Med., 122:1049-1062 (1965); Weigle, J. Exn. Med., 121:289- 
308(1965). 

The polypeptides and peptide fragments of the invention may be in 
25 glycosylated form, or in unglycosylated form. A polypeptide or peptide 

fragment of the invention may be soluble or insoluble in aqueous solution. The 
polypeptides and peptide fragments of the invention may be conservative 
variants. A conservative variant is a polypeptide or peptide fragment derived 
from a fiiU-length polypeptide, such as that exempUfied by SEQ ID NO: 1, by 
30 deletion (so-called truncation), addition, or subtraction of one or more amino 
acids to the N-termmal and/or C-teraiinal end of the full-length polypeptide; 
deletion, addition or subtraction of one or more amino acids at one or more sites 
in the full-length polypeptide. Such variants may result from, for example, 
genetic polymorphism or from human manipulation. Methods for such 
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manipulations are generally known in the art. For example, amino acid sequence 
variants of SEQ ID NO: 1 can be prepared by mutagenesis of DNA encoding the 
polypeptide. Methods for mutagenesis and nucleotide sequence alterations are 
well known in the art. See, for example, Kunkel, Proc. Natl. Acad, Sci. USA, 
5 82, 488 (1985); Kunkel et aL, Methods in EnzvmoL, 154:367 (1987); U. S. 
Patent No. 4,873,192; Walker and Gaastra, eds., Techniques in Molecular 
Biology. MacMillan Publishing Company, New York (1983) and the references 
cited therein. Guidance as to appropriate amino acid substitutions maybe found 
in the model of Dayhoff et al.. Atlas of Protein Sequence and Structure. Natl. 

10 Biomed. Res. Found., Washington, CD. (1978), herein incorporated by 

reference. Conservative substitutions, such as exchanging one amino acid with 
another having similar properties, are preferred. For example, substitution of a 
hydrophobic amino acid for another, or substitution of a hydrophilic amino acid 
for another. Routine screening assays can be used to determine if a substituted 

15 polypeptide or peptide fragment derived from SEQ ID NO: 1 produces an 

immune response when administered to a mammal. Examples of such screening 
assays are well known in the art and include enzyme linked immunosorbant 
assays, radioimmuno assays, chromiimi release assays, and the like. Such assays 
have been described. Harlow et al., Antibodies: A Laboratory Manual, page 319 

20 (Cold Spring Harbor Pub. 1 988), 

The invention provides peptidomimetics of the polypeptides and peptide 
fragments of the invention. A peptidomimetic describes a peptide analog, such 
as those commonly used in the pharmaceutical industry as non-peptide drugs, 
with properties analogous to those of the template peptide. (Fauchere, J., Adv. 

25 Drug Res., 15: 29 (1986) and Evans et al., J. Med. Chem.. 30:1229 (1987)). 
Peptidomimetics are stmcturally similar to polypeptides or peptide fragments 
having peptide bonds, but have one or more peptide linkages optionally replaced 
by a linkage such as, --CH2NH--, ~CH2S-, --CH2-CH2-, -CH=CH- (cis and 
trans), --COCH2-, -CH(OH)CH2-, and ~CH2SO-, by methods knovm in the 

30 art. Advantages of peptide mimetics over natural polypeptide embodiments may 
include more economical production, greater chemical stabiUty, altered 
specificity and enhanced phaimacological properties such as half-life, 
absorption, potency and efficacy. 
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The polypeptides, peptide fragments, coupled proteins, and 
peptidomimetics of the invention can be modified for in vivo use by the addition, 
at the amino-terminus and/or the carboxyl-terminus, of a blocking agent to 
decrease degradation in vivo. This can be useful in those situations in which the 
5 polypeptide termini tend to be degraded by proteases prior to cellular uptake. 
Such blocking agents can include, without Umitation, additional related or 
unrelated peptide sequences that can be attached to the amino and/or carboxyl 
terminal residues of the polypeptide, peptide fragment, coupled protein, and 
peptidomimetic to be administered. This can be done either chemically during 

10 the synthesis of the polypeptide, peptide fragment, or coupled protein, or by 
recombinant DNA technology by methods familiar to artisans of average skill. 
Alternatively, blocking agents such as pyroglutamic acid, or other molecules 
known in the art, can be attached to the amino and/or carboxyl terminal residues, 
or the amino group at the amino terminus or carboxyl group at the carboxyl 

15 terminus can be replaced with a different moiety. Accordingly, the invention 
provides polypeptides and peptide fragments that are amino-terminally and 
carboxyl-terminally blocked. 

The abihty of a polypeptide or peptide fragment of the invention to 
produce an immune response may be tested through numerous art recognized 

20 methods. For example, for their ability to induce antibody production, or to 
stimulate a cytotoxic T-lymphocyte response. 

The polypeptides and peptide fragments of the invention may be used 
within screening assays to identify or isolate antibodies that bind to the 
polypeptides or peptide fragments of the invention, or the spike protein from 

25 SARS-CoV. For example, the polypeptides or peptide fragments may be used in 
phage display assays to isolate antibodies that bind to the polypeptides or peptide 
fragments. In another example, the polypeptides or peptide fragments of the 
invention may be bound to a soUd support to which antibodies are contacted 
such that antibodies which bind to the polypeptides or peptide fragments become 

30 immobilized on the solid support. These antibodies can be later eluted from the 
solid support. The polypeptides and peptide fragments of the mvention may be 
used to isolate antibodies according to many other methods known in the art. 

Expression systems that may be used for small or large scale production 
of the, coupled proteuis, polypeptides or peptide fragments of the invention 
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include, but are not limited to, cells or microorganisms that are transformed with 
a recombinant nucleic acid construct that contains a nucleic acid segment of the 
invention. Examples of recombinant nucleic acid constructs may include 
bacteriophage DNA, plasmid DNA, cosmid DNA, or viral expression vectors. 
5 Examples of cells and microorganisms that may be transformed include bacteria 
(for example, E. coU or B. subtilis); yeast (for example, Saccharomyces and 
Pichia); insect cell systems (for example, baculovirus); plant cell systems; or 
mammalian cell systems (for example, COS, CHO, BHK, 293, VERO, HeLa, 
MDCK, W138, and NIH 3T3 cells). Also useful as host cells are primary or 

1 0 secondary cells obtained directly from a mammal that are transfected with a 

plasmid vector or infected with a viral vector. Examples of suitable expression 
vectors include, without limitation, plasmids and viral vectors such as herpes 
viruses, retroviruses, vaccinia viruses, attenuated vaccinia viruses, canary pox 
viruses, adenoviruses, adeno-associated viruses, lentiviruses and herpes viruses, 

1 5 among others. Synthetic methods may also be used to produce polypeptides and 
peptide fragments of the invention. Such methods are known and have been 
reported. Merrifield, Science, 85:2149 (1963). 

n. Nucleic acid segments, expression cassettes, and nucleic acid constructs 

20 of the invention 

The present invention provides isolated nucleic acid segments that 
encode the polypeptides, peptide fragments, and coupled proteins of the 
invention. The nucleic acid segments of the invention also include segments that 
encode for the same amino acids due to the degeneracy of the genetic code. For 

25 example, the amino acid threonine is encoded by ACU, ACC, ACA and ACG 
and is therefore degenerate. It is intended that the invention includes all 
variations of the polynucleotide segments that encode for the same amino acids. 
Such mutations are known in the art (Watson et al. Molecular Biology of the 
Gene, Benjamin Cummings 1987). Mutations also include alteration of a nucleic 

30 acid segment to encode for conservative amino acid changes, for example, the 
substitution of leucine for isoleucine and so forth. Such mutations are also 
known in the art. Thus, the genes and nucleotide sequences of the invention 
include both the naturally occurring sequences as well as mutant forms. 
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The nucleic acid segments of the invention may be contained within a 
vector. A vector may include, but is not limited to, any plasmid, phagemid, F- 
factor, virus, cosmid, or phage in double or single stranded Mnear or circular 
form which may or may not be self transmissible or mobilizable. The vector can 
5 also transform a prokaryotic or eukaryotic host either by integration into the 
cellular genome or exist extra-chromosomally (e.g. autonomous replicating 
plasmid with an origin of replication). 

Preferably the nucleic acid segment in the vector is under the control of, 
and operably linked to, an appropriate promoter or other regulatory elements for 
10 transcription in vitro or in a host cell, such as a eukaryotic cell, or a microbe, e.g. 
bacteria. The vector may be a shuttle vector that functions in multiple hosts. The 
vector may also be a cloning vector that typically contains one or a small number 
of restriction endonuclease recognition sites at which foreign DNA sequences 
can be inserted in a determinable fashion. Such insertion can occur without loss 
15 of essential biological function of the cloning vector. A cloning vector may also 
contain a marker gene that is suitable for use in the identification and selection 
of cells transformed with the cloning vector. Examples of marker genes are 
tetracycline resistance or ampicillin resistance. Many cloning vectors are 
commercially available (Stratagene, New England Biolabs, Clonetech). 
20 The nucleic acid segments of the invention may also be inserted into an 

expression vector. Typically an expression vector contains prokaryotic DNA 
elements coding for a bacterial replication origin and an antibiotic resistance 
gene to provide for the amplification and selection of the expression vector in a 
bacterial host; regulatory elements that control initiation of transcription such as 
25 a promoter; and DNA elements that control the processing of transcripts such as 
introns, or a transcription termination / polyadenylation sequence. 

Methods to introduce nucleic acid segment into a vector are available in 
the art (Sambrook et al.. Molecular Cloning: A Laboratory Manual, 3rd edition. 
Cold Spring Harbor Press, Cold Spring Harbor, N.Y. (2001)). Briefly, a vector 
30 into which a nucleic acid segment is to be inserted is treated with one or more 
restriction enzymes (restriction endonuclease) to produce a linearized vector 
having a blunt end, a "sticky" end with a 5' or a 3* overhang, or any combination 
of the above. The vector may also be treated with a restriction enzyme and 
subsequently treated with another modifying enzyme, such as a polymerase, an 
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exonuclease, a phosphatase or a kinase, to create a hnearized vector that has 
characteristics useful for ligation of a nucleic acid segment into the vector. The 
nucleic acid segment that is to be inserted into the vector is treated with one or 
more restriction enzymes to create a linearized segment having a blunt end, a 
5 "sticky" end with a 5' or a 3' overhang, or any combination of the above. The 
nucleic acid segment may also be treated with a restriction enzyme and 
subsequently treated with another DNA modifying enzyme. Such DNA 
modifying enzymes include, but are not limited to, polymerase, exonuclease, 
phosphatase or a kinase, to create a nucleic acid segment that has characteristics 

10 useful for ligation of a nucleic acid segment into the vector. 

The treated vector and nucleic acid segment are then hgated together to 
form a construct containing a nucleic acid segment according to methods 
available in the art (Sambrook et al., Molecular Cloning: A Laboratory Manual, 
3rd edition. Cold Spring Harbor Press, Cold Spring Harbor, N.Y. (2001)). 

15 Briefly, the treated nucleic acid fragment and the treated vector are combined in 
the presence of a suitable buffer and ligase. The mixture is then incubated under 
appropriate conditions to allow the ligase to ligate the nucleic acid fragment into 
the vector. 

The invention also provides an expression cassette which contains a 
20 nucleic acid sequence capable of directing expression of a particular nucleic acid 
segment of the invention, such as SEQ ID NO: 2, either in vitro or in a host cell. 
Also, a nucleic acid segment of the invention may be inserted into the expression 
cassette such that an anti-sense message is produced. The expression cassette is 
an isolatable unit such that the expression cassette may be in linear fomi and 
25 functional for in vitro transcription and translation assays. The materials and 
procedures to conduct these assays are commercially available from Promega 
Corp. (Madison, Wisconsin). For example, an in vitro transcript may be 
produced by placing a nucleic acid sequence under the control of a T7 promoter 
and then using T7 RNA polymerase to produce an in vitro transcript. This 
30 transcript may then be translated in vitro through use of a rabbit reticulocyte 
lysate. Altematively, the expression cassette can be incorporated into a vector 
allowing for replication and amplification of the expression cassette within a 
host cell or also in vitro transcription and translation of a nucleic acid segment. 
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Such an expression cassette may contain one or a plurality of restriction 
sites allowing for placement of the nucleic acid segment under the regulation of 
a regulatory sequence. The expression cassette can also contain a termination 
signal operably linked to the nucleic acid segment as well as regulatory 
5 sequences required for proper translation of the nucleic acid segment. The 
expression cassette containing the nucleic acid segment maybe chimeric, 
meaning that at least one of its components is heterologous with respect to at 
least one of its other components. The expression cassette may also be one 
which is naturally occurring but has been obtained in a recombinant form useful 

10 for heterologous expression. Expression of the nucleic acid segment in the 
expression cassette may be under the control of a constitutive promoter or an 
inducible promoter which initiates transcription only when the host cell is 
exposed to some particular external stimulus. 

The expression cassette may include in the 5-3' direction of transcription, 

15 a transcriptional and translational initiation region, a nucleic acid segment and a 
transcriptional and translational termination region functional in vivo and /or in 
vitro. The termination region may be native with the transcriptional initiation 
region, may be native with the nucleic acid segment, or may be derived from 
another source. 

20 The regulatory sequence can be a polynucleotide sequence located 

upstream (5' non-coding sequences), within, or downstream (3' non-coding 
sequences) of a coding sequence, and which influences the transcription, RNA 
processing or stability, or translation of the associated coding sequence. 
Regulatory sequences can include, but are not limited to, enhancers, promoters, 

25 repressor binding sites, translation leader sequences, introns, and 

polyadenylation signal sequences. They may include natural and synthetic 
sequences as well as sequences which may be a combination of synthetic and 
natural sequences. While regulatory sequences are not limited to promoters, 
some useful regulatory sequences include constitutive promoters, inducible 

30 promoters, regulated promoters, tissue-specific promoters, viral promoters and 
synthetic promoters. 

A promoter is a nucleotide sequence which controls the expression of liie 
coding sequence by providing the recognition for RNA polymerase and other 
factors required for proper transcription. A promoter includes a minimal 
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promoter, consisting only of all basal elements needed for transcription 
initiation, such as a TATA-box and/or initiator that is a short DNA sequence 
comprised of a TATA- box and other sequences that serve to specify the site of 
transcription initiation, to which regulatory elements are added for control of 
5 expression. A promoter may be derived entirely jfrom a native gene, or be 
composed of different elements derived from different promoters found in 
nature, or even be comprised of synthetic DNA segments. A promoter may 
contain DNA sequences that are mvolved in the binding of protein factors which 
control the effectiveness of transcription initiation in response to physiological 

10 or developmental conditions. 

The invention also provides a construct containing a vector and an 
expression cassette. The vector may be selected from, but not limited to, any 
vector previously described. Into this vector may be inserted an expression 
cassette through methods knovm in the art and previously described (Sambrook 

15 et al.. Molecular Cloning: A Laboratory Manual, 3rd edition. Cold Spring 
Harbor Press, Cold Spring Harbor, N.Y. (2001)). fri one embodiment, the 
regulatory sequences of the expression cassette may be derived from a source 
other than the vector into which the expression cassette is inserted. In another 
embodiment, a construct containing a vector and an expression cassette is 

20 formed upon insertion of a nucleic acid segment of the invention into a vector 
that itself contains regulatory sequences. Thus, an expression cassette is formed 
\ upon insertion of the nucleic acid segment into the vector. Vectors containing 

regulatory sequences are available commercially and methods for their use are 
known in the art (Clonetech, Promega, Stratagene). 

25 

ni. Immune compositions and vaccines of the invention 

The invention provides immune compositions and vaccines that can be 
used to produce an immune response against the vuns that is etiologically linked 
to severe acute respiratory syndrome when administered to an animal. The 
30 immune response may be a humoral inmiune response or a cellular immtme 
response. 

An rmmime composition of the invention can include an adjuvant and a 
nucleic acid, polypeptide, peptide fragment, a peptidon^metic, a coupled protein, 
an immunopeptide of the invention, or any combination thereof. An immune 
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composition can contain an adjuvant that is not chemically linked to a 
polypeptide, peptide fragment, a peptidomimetic, a coupled protein, or an 
immunopeptide of the invention. An immune composition can contain an 
adjuvant that is chemically linked to a polypeptide, peptide fragment, a 
5 peptidomimetic, a coupled protein, or an immunopeptide of the invention. An 
immune composition of the invention can also include a pharmaceutically 
acceptable diluent or carrier. 

An immune composition may be manufactured conventionally. In 
particular, a nucleic acid, polypeptide, peptide fragment, peptidomimetic, 

10 coupled protein, immunopeptide, or any combination thereof that is contained in 
the composition may be combined with a pharmaceutically acceptable diluent or 
carrier. Examples of pharmaceutically acceptable diluent or carriers include 
water or a saline solution, such as phosphate-buffered saline (PBS). In general, 
the pharmaceutically acceptable diluent or carrier is selected on the basis of the 

15 mode and route of administration and of standard pharmaceutical practices. 

Pharmaceutically acceptable diluents and carriers as well as all that is necessary 
for their use in pharmaceutical compositions are described in Remington's 
Pharmaceutical Sciences, a standard reference text in this field. 

Immune compositions may contain adjuvants as disclosed herein and as 

20 known in the art. Aluminum compounds may be used as adjuvants. Such 
aluminum compounds include, aluminimi hydroxide, aluminum phosphate, 
aluminum hydroxyphosphate, and the like. The nucleic acid, polypeptide, 
peptide fragment, peptidomimetic, coupled protein, immunopeptide, or any 
combination thereof may be absorbed or precipitated on an aluminum compound 

25 according to standard methods. Other adjuvants include polyphosphazene (WO 
95/2415), DC-chol (3"beta-[N-(N', N'-dimethylaminomethane) carbamoyl) 
cholesterol] (U.S. Pat No. 5,283,185 and WO 96/14831), QS-21 (WO 88/9336) 
and RIBI from LnmunoChem (Hamilton, Montana). Immunostimulatory 
oligonucleotides containing unmethylated CpG dinucleotides ("CpG") are 

30 known in the art as being adjuvants when administered by both systemic and 
mucosal routes (WO 96/02555, EP 468520, Davis et al., J. LmnunoL. 160:870 
(1998); McCluskie and Davis, J. Immunol., 161 :4463 (1998). CpG when 
formulated into immime compositions or vaccines, is generally administered in 
free solution together with free antigen (WO 96/02555; McCluskie and Davis, L 
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ImmimoL, 161:4463 (1998)) or covalently conjugated to an antigen (PCT 
Publication No. WO 98/16247), or formulated with a carrier such as aluminum 
hydroxide. (Brazolot-Millan et aL, Proc.Natl.Acad.Sci., 95:15553 (1998)). 
The invention also provides vaccines that include a nucleic acid, 
5 polypeptide, a peptide fragment, a peptidomimetic, a coupled protein, an 
immunopeptide of the invention, a nucleic or any combination thereof. Such 
vaccines can be formulated as described herein or as known in the vaccine arts. 
For example, a viral vaccine may be created that expresses a polypeptide, a 
peptide fragment, or a coupled protein of the invention according to methods 
10 known in the art. Examples of viral vectors that may be used include, 

adenoviruses, herpes vimses, vaccinia viruses, canarypox viruses, and the like. 
Vaccines can also be formulated as a liposome. Such formulations are known to 
those skilled in the art. Liposomes: A Practical Approach. RRC New Ed, IRL 
press (1990). 

15 The invention also provides nucleic acid based vaccines that express a 

polypeptide, a peptide fragment, or a coupled protein of the invention. For 
example, a nucleic acid vaccine can express a polypeptide having SEQ ID NO: 
1, 13, 14, 15, 20-59, 61-63 or a fragment of SEQ ID NO: 1. Inoculation of an 
animal with a nucleic acid construct that encodes a polypeptide, a peptide 

20 fragment, or a coupled protein of the invention may lead to a hxmaoral and cell- 
mediated immune response to the encoded antigen. It is thought that some bone 
marrow-derived professional antigen presenting cells are transfected by the 
nucleic acid constmct and the encoded antigen is transcribed and translated into 
an immunogenic polypeptide that elicits specific responses. A feature of nucleic 

25 acid vaccines is that they provide for eliciting strong cytotoxic T-lymphocyte 
(CTL) responses. These responses occur because the nucleic acid-encoded 
polypeptides are synthesized in the cytosol of transfected cells. Furthermore, 
nucleic acid constructs that are produced in bacteria are rich in uimiethylated 
CpG nucleotides that are recognized as foreign by macrophages. Thus, they 

30 elicit an innate immune response that enhances adaptive immunity. Therefore, 
nucleic acid vaccines are effective even when administered without adjuvants. 

Direct injection of an expression cassette into living host cells transforms 
I a number of the cells and causes them to express the introduced nucleic acid and 
thereby express a gene product. The transfected cells may display fragments of 
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the expressed antigens on their cell surfaces together with major 
histocompatibiUty class I (MHC 1) or class II (MHC II) complexes. 

Nucleic acid constructs can be introduced into cells more efficiently by 
inducing muscle degeneration prior to the injection of the nucleic acid construct 
5 into an animal, including a human (Vitadello et. aL, Hum. Gene. Ther., 5:11 
(1994) ; Danko and Wolff, Vaccine. 12:1499 (1994); Davis et. al.. Hum. Gene. 
Ther.. 4:733 (1993)). For example, such a treatment is thought to increase the 
efficiency of transfer by up to 40 fold. Two of the most commonly used 
myonecrotic agents are the local anesthetic bupivicaine, and cardiotoxin (Danko 

10 and Wolff, Vaccine, 12:1499 (1994); Davis et. al., Hum. Gene. Ther. , 4:733 

(1993)). A number of other techniques have been employed to transfer nucleic 
acid constmcts to muscle. Such other techniques include retroviral vectors, 
adenoviral vectors, and liposomes. However, direct injection of naked nucleic 
acid appears to be the most efficient of these delivery mechanisms at transferring 

1 5 and expressing foreign nucleic acids in cells. 

Nucleic acid constructs can be administered in a pharmaceutically 
acceptable carrier. Pharmaceutically acceptable carriers are biologically 
compatible vehicles which are suitable for administration to a human or other 
mammalian subject, e.g., physiological saUne. A therapeutically effective 

20 amoimt is an amoimt of the nucleic acid construct that is capable of producing an 
immune response (e.g., an enhanced T-cell response or antibody production) in a 
treated animal. As is well known in the medical arts, the dosage for any one 
patient depends upon many factors, including the patient^s size, body surface 
area, age, the particular compoimd to be administered, sex, time and route of 

25 administration, general health, and other dmgs being administered concurrently. 
Dosages will vary, but a preferred dosage for administration of a nucleic acid 
construct is fi-om approximately 10^ to 10^^ copies of the nucleic acid construct. 
This does can be repeatedly administered, as needed. 

Nxraierous routes of administration may be used to administer nucleic 

30 acid constructs. Examples of such routes include intramuscular injection, 

intravenous, intraperitoneal, intradermal, intranasal and subcutaneous injection 
of nucleic acid constructs have all resulted in immimization against influenza 
virus hemagglutinin (HA) in chickens (reviewed in PardoU and Beckerkleg, 
Immunity 3 (1995), 165-169). Nucleic acid based vaccines can also be 
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administered through use of a polymeric, biodegradable microparticle or 
microcapsule delivery vehicle, sized to optimize phagocytosis by phagocytic 
cells such as macrophages. For example, PLGA (poly-lacto-co-glycolide) 
microparticles approximately 1-10 jtim in diameter can be used. The nucleic acid 
5 construct is encapsulated in these microparticles, which are taken up by 

macrophages and gradually biodegraded within the cell, thereby releasing the 
nucleic acid construct. Once released, the nucleic acid is expressed within the 
cell. Another way to achieve iiptake of a nucleic acid construct is through use of 
liposomes. Such liposomes can be prepared by standard methods. The nucleic 

10 acid constructs can be incorporated alone into these delivery vehicles or co- 
incorporated with tissue-specific antibodies. Alternatively, a molecular 
conjugate can be prepared that is composed of a nucleic acid construct attached 
to poly-L-lysine by electrostatic or covalent forces. Poly-L-lysine binds to a 
ligand that can bind to a receptor on target cells. Cristiano et al. (1995), J. Mol. 

15 Med. 73, 479). Altematively, lymphoid tissue specific targeting can be achieved 
by the use of lymphoid tissue-specific transcriptional regulatory elements (TRE) 
such as a B lymphocyte, T lymphocyte, or dendritic cell specific TRE. 
Lymphoid tissue specific TRE are known (Thompson et al., MoL CelL Biol.. 
12:1043 (1992); Todd et al., J. Exp. Med., 177:1663 (1993); Penix et al., J. Exp. 

20 Med.. 178:1483 (1993)). 

The invention also provides microbe based vaccines. Generally, these 
vaccines relate to microbes that have been transformed with a nucleic acid 
construct that provides for the expression of a polypeptide, a peptide fragment, 
or a coupled protein of the invention. For example. Listeria monocytogenes may 

25 be used as a vector to elicit T-cell irmnunity. This is because it infects antigen- 
presenting cells and also because infection originates at the mucosa. Lieberman 
and Frankel, Vaccine. 20:2007-10 (2002). According, Listeria may be 
transformed with a nucleic acid construct that provides for the expression of a 
polypeptide, a peptide firagment, or a coupled protein that elicits an immime 

30 response against the spike protein fi:om the coronavirus that causes severe acute 
respiratory syndrome. Highly attenuated forms of Listeria may be constmcted 
according to methods reported in the art. Lieberman and Frankel, Vaccine, 
20:2007 (2002). Sabnonella may also be used as a vector to elicit a cytotoxic T 
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lymphocyte (CTL) response against the coronavirus that causes severe acute 
respiratory syndrome. Pasetti et al.. Infect Inmiun., 70:4009 (2002). 

An immune composition or vaccine may be administered by any 
conventional route used in the field of vaccines. For example, an immune 
5 composition or vaccine can be administered orally or by intravenous infusion, or 
injected subcutaneously, intramuscularly, intraperitoneally, intrarectally, 
intravaginally, intranasally, intragastrically, intratracheally, or intrapubnonarily. 
The choice of the administration route depends on a number of parameters such 
as the nature of the active principle; the identity of the polypeptide, peptide 

10 fragment, peptidomimetic, coupled protein, immtmopeptide, DNA vaccine; or 
the adjuvant that is combined with the aforementioned molecules. 
Administration of an immune composition may take place in a single dose or in 
a dose repeated once or several times over a certain period. The appropriate 
dosage varies according to various parameters. Such parameters include the 

15 individual treated (adult or child), the immune composition or antigen itself, the 
mode and firequency of administration, the presence or absence of adjuvant and, 
if present, the type of adjuvant and the desired effect (e.g. protection or 
treatment), as will be determined by persons skilled in the art. 

20 IV. Antibodies and aptamers of the invention 

The invention provides antibodies that bind to an amino acid sequence as 
set forth in SEQ ID NO: 1, 13, 14, 15, 20-59, 60, 61, 62, 63 or a fi-agment of 
SEQ ID NO: 1, or conservative variants thereof Such antibodies are useful for 
the diagnosis, immunization against, and treatment of severe acute respiratory 

25 syndrome (SARS). In some embodiments, the antibody binds to a peptide 

having SEQ ID NO:58 or 59. Antibodies that bind to the P540 peptide (SEQ ID 
NO:59) are highly effective, and can detect spike polypeptides even after 
extensive dilution. For example, a P540 antibody preparation diluted 1 :10,000 
could still detect spike polypeptides. 

30 Antibodies can be prepared using an intact polypeptide or peptide 

fragment of interest as the immunizing antigen. The polypeptide or fragment 
used to immimize an animal can be derived from translated cDNA or chemical 
synthesis. A polypeptide or peptide fragment can be coupled to a carrier protein, 
if desired. Such commonly used carrier proteins which are chemically coupled 
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to the peptide include keyhole limpet hemocyanin (KLH), thyroglobulin, bovine 
serum albumin (BSA), and tetanus toxoid. A coupled protein can be used to 
immunize the animal (e,g., a mouse, a rat, or a rabbit). 

If desired, polyclonal or monoclonal antibodies can be further purified, 
5 for example, by binding to and elution &om a matrix to which the polypeptide or 
peptide fragment to which the antibodies were raised is bound. Those of skill in 
^ the art will know of various techniques common in the inomaunology arts for 
purification and/or concentration of polyclonal antibodies, as well as monoclonal 
antibodies (Coligan, et al.. Unit 9, Current Protocols in Immunology, Wiley 
10 Interscience, 1991, incorporated by reference). 

It is also possible to use the anti-idiotype technology to produce 
monoclonal antibodies which mimic an epitope. For example, an anti-idiotypic 
monoclonal antibody made to a first monoclonal antibody will have a binding 
domain in the hypervariable region which is the "image" of the epitope boimd by 
1 5 the first monoclonal antibody. 

An antibody suitable for binding to a polypeptide or peptide fragment is 
specific for at least one portion of a region of the polypeptide. For example, one 
of skill in the art can use a peptide fragment to generate appropriate antibodies of 
the invention. Antibodies of the invention include polyclonal antibodies, 
20 monoclonal antibodies, and fragments of polyclonal and monoclonal antibodies. 

The preparation of polyclonal antibodies is well-known to those skilled 
in the art (Green et al., Production of Polyclonal Antisera, in Immunochemical 
Protocols (Manson, ed.), pages 1-5 (Humana Press 1992); Coligan et al., 
Production of Polyclonal Antisera in Rabbits, Rats, Mice and Hamsters, in 
25 Current Protocols in Immunology, section 2.4.1 (1992), which are hereby 

incorporated by reference). For example, a polypeptide or peptide fragment is 
injected into an animal host, preferably according to a predetermined schedule 
incorporating one or more booster immunizations, and the animal is bled 
periodically. Polyclonal antibodies specific for the polypeptide or peptide 
30 fragment may then be purified from such antisera by, for example, affinity 

chromatography using the polypeptide or peptide fragment coupled to a suitable 
solid support. 

The preparation of monoclonal antibodies likewise is conventional 
(Kohler & Milstein, Nature, 256:495 (1975); Coligan et aL, sections 2.5.1-2.6.7; 
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and Harlow et al., Antibodies: A Laboratory Manual, page 726 (Cold Spring 
Harbor Pub, 1988)), which are hereby incorporated by reference. Briefly, 
monoclonal antibodies can be obtained by injecting mice with a composition 
comprising an antigen, verifying the presence of antibody production by 
5 removing a serum sample, removing the spleen to obtain B lymphocytes, fusing 
the B lymphocytes with myeloma cells to produce hybridomas, cloning the 
hybridomas, selecting positive clones that produce antibodies to the antigen, and 
isolating the antibodies from the hybridoma cultures. Monoclonal antibodies can 
be isolated and purified from hybridoma cultures by a variety of well-established 

10 techniques. Such isolation techniques include affinity chromatography with 
Protein-A Sepharose, size-exclusion chromatography, and ion-exchange 
chromatography (Coligan et al., sections 2.7.1-2.7.12 and sections 2.9.1-2.9.3; 
Bames et al., Purification of Immunoglobulin G (IgG), in Methods in Molecular 
Biology, Vol. 10, pages 79-104 (Humana Press 1992)). Methods of in vitro and 

15 in vivo multiplication of monoclonal antibodies is well-known to those skilled in 
the art. Multiplication in vitro may be carried out in suitable culture media such 
as Dulbecco's Modified Eagle Medium or RPMI 1640 medium, optionally 
replenished by a mammalian serum such as fetal calf serum or trace elements 
and growth-sustaining supplements such as normal mouse peritoneal exudate 

20 cells, spleen cells, bone marrow macrophages. Production in vitro provides 

relatively pure antibody preparations and allows scale-up to yield large amounts 
of the desired antibodies. Large scale hybridoma cultivation can be carried out 
by homogenous suspension culture in an air reactor, in a continuous stirrer 
reactor, or immobilized or entrapped cell culture. Multiplication in vivo may be 

25 carried out by injecting cell clones into mammals histocompatible with the 
parent cells, e.g., osyngeneic mice, to cause growth of antibody-producing 
tumors. Optionally, the animals are primed with a hydrocarbon, especially oils 
such as pristine tetramethylpentadecane prior to injection. After one to three 
weeks, the desired monoclonal antibody is recovered firom the body fluid of the 

30 animal. 

Antibodies can also be prepared through use of phage display techniques. 
In one example, an organism is immunized with an antigen, such as a 
polypeptide or peptide fragment of the invention. Lymphocytes are isolated 
from the spleen of the immimized organism. Total RNA is isolated from the 
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splenocytes and mRNA contained within the total RNA is reverse transcribed 
into complementary deoxyribonucleic acid (cDNA). The cDNA encoding the 
variable regions of the light and heavy chains of the immxmoglobulin is 
amplified by polymerase chain reaction (PGR). To generate a single chain 
5 fragment variable (scFV) antibody, the light and heavy chain amplification 

products may be linked by splice overlap extension PGR to generate a complete 
sequence and Ugated into a suitable vector. E. coli are then transformed with the 
vector encoding the scF V, and are infected with helper phage, to produce phage 
particles that display the antibody on their surface. Altematively, to generate a 

1 0 complete antigen binding fragment (Fab), the heavy chain amplification product 
can be fused with a nucleic acid sequence encoding a phage coat protein, and the 
light chain amplification product can be cloned into a suitable vector. E. coli 
expressing the heavy chain fused to a phage coat protein are transformed with 
the vector encoding the light chain amplification product. The disulphide 

15 linkage between the light and heavy chains are established in the periplasm of E. 
coli. The result of this procedure is to produce an antibody library with up to 10^ 
clones. The size of the library can be increased to 10^^ phage by later addition of 
the immune responses of additional immunized organisms that may be from the 
same or different hosts. Antibodies that recognize a specific antigen can be 

20 selected through panning. Briefly, an entire antibody library can be exposed to 
an immobilized antigen against which antibodies are desired. Phage that do not 
express an antibody that binds to the antigen are washed away. Phage that 
express the desired antibodies are immobilized on the antigen. These phage are 
then eluted and again amplified in E. coli. This process can be repeated to enrich 

25 the population of phage that express antibodies that specifically bind to the 
antigen. After phage are isolated that express an antibody that binds to an 
antigen, a vector containing the coding sequences for the antibody can be 
isolated from the phage particles and the coding sequences can be recloned into a 
suitable vector to produce an antibody in soluble form. In another example, a 

30 human phage library can be used to select for antibodies, such as monoclonal 
antibodies, that bind to the spike protein from SARS-GoV. Briefly, splenocytes 
may be isolated from a human that is infected, or not infected, with SARS-CoV 
and used to create a human phage library according to methods as described 
above and known in the art. These methods may be used to obtaui human 
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monoclonal antibodies that brad to the spike protein of S ARS-CoV. Phage 
display methods to isolate antigens and antibodies are knoA?vn in the art and have 
been described (Gram et al., Proc. Natl. Acad, Sci.. 89:3576 (1992); Kay et al.. 
Phage display of peptides and proteins: A laboratory manual. San Diego: 
5 Academic Press (1996); Kermani et al.. Hybrid. 14:323 (1995); Schmitz et al., 
Placenta. 21 Suppl. A:S106 (2000); Sanna et al., Proc. Natl. Acad. Sci.. 92:6439 
(1995)). 

An antibody of the invention may be derived from a "humanized" 
monoclonal antibody. Humanized monoclonal antibodies are produced by 

10 transferring mouse complementarity determining regions from heavy and light 
variable chains of the mouse immunoglobulin into a human variable domain, and 
then substituting human residues in the framework regions of the murine 
counterparts. The use of antibody components derived from humanized 
monoclonal antibodies obviates potential problems associated with the 

15 immunogenicity of murine constant regions. General techniques for cloning 
murine immunoglobulin variable domains are described (Qrlandi et al., Proc. 
Natl Acad. Sci, USA , 86:3833 (1989) which is hereby incorporated in its 
entirety by reference). Techniques for producing humanized monoclonal 
antibodies are described (Jones et al., Nature, 321 :522 (1986); Riechmann et al., 

20 Nature, 332:323 (1988); Verhoeyen et al. Science . 239:1534 (1988); Carter et 
al., Proc. Natl Acad. Sci. USA . 89:4285 (1992); Sandhu, Crit. Rev. Biotech.. 
12:437 (1992); and Singer et al., J. hnmunol., 150:2844 (1993), which are 
hereby incorporated by reference). 

Li addition, antibodies of the present invention may be derived from a 

25 human monoclonal antibody. Such antibodies are obtained from transgenic mice 
that have been "engineered" to produce specific human antibodies in response to 
antigenic challenge. In this technique, elements of the human heavy and light 
chain loci are introduced into strains of mice derived from embryonic stem cell 
lines that contain targeted disruptions of the endogenous heavy and light chain 

30 loci. The transgenic mice can synthesize human antibodies specific for human 
antigens, and the mice can be used to produce human antibody-secreting 
hybridomas. Methods for obtaining human antibodies from transgenic mice are 
described (Green et al.. Nature Genet.. 7: 13 (1994); Lonberg et al., Nature, * 
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368:856 (1994); and Taylor et al.. Int. ImtnimoL, 6:579 (1994), which are hereby 
incorporated by reference). 

Antibody fragments of the invention can be prepared by proteolytic 
hydrolysis of the antibody or by expression in E. coli of DNA encoding the 
5 fragment. Antibody fragments can be obtained by pepsin or papain digestion of 
whole antibodies by conventional methods. For example, antibody fragments 
can be produced by enzymatic cleavage of antibodies with pepsin to provide a 
5S fragment denoted F(ab')2. This fragment can be finther cleaved using a thiol 
reducing agent, and optionally a blocking group for the sulfliydryl groups 

10 resulting from cleavage of disulfide linkages, to produce 3.5S Fab' monovalent 
fragments. Altematively, an enzymatic cleavage using pepsin produces two 
monovalent Fab' fragments and an Fc fragment directly. These methods are 
described (U.S. patents No. 4,036,945; 4,331,647; and 6,342,221, and references 
contained therein; Porter, Biochem. J. . 73: 119 (1959); Edehnan et al.. Methods 

15 in Enzymology, Vol. 1, page 422 (Academic Press 1967); and Coligan et al. at 
sections 2.8.1-2.8.10 and 2.10.1-2.10.4). 

Other methods of cleaving antibodies, such as separation of heavy chains 
to form monovalent light-heavy chain fragments, ftirther cleavage of fragments, 
or other enzymatic, chemical, or genetic techniques may also be used, so long as 

20 the fragments bind to the antigen that is recognized by the intact antibody. 

For example, Fv fragments comprise, an association of Vh and Vl 
chains. This association may be noncovalent (Inbar et al., Proc. Natl Acad. Sci. 
USA , 69:2659 (1972)). Altematively, the variable chains can be linked by an 
intermolecular disulfide bond or cross-linked by chemicals such as 

25 glutaraldehyde (Sandhu, Crit. Rev. Biotech, . 12:437 (1992)). Preferably, the Fv 
fragments comprise Vh and Vl chains connected by a peptide linker. These 
single-chain antigen binding proteins (sFv) are prepared by constructing a 
structural gene comprising DNA sequences encoding the Vh and Vl domains 
coimected by an oligonucleotide. The structural gene is inserted into an 

30 expression vector, which is subsequently introduced into a host cell such as E. 
coli. The recombinant host cells synthesize a single polypeptide chain with a 
linker peptide bridging the two V domains. Methods for producing sFvs are 
described (Whitlow et al.. Methods: A Companion to Methods in Enzymology, 
Vol. 2, page 97 (1991); Bird et aL, Science, 242:423 (1988), Ladner et al., U.S. 
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patent No. 4,946,778; Pack et al., Bio/Technology, 11:1271 (1993); and Sandhu, 
Crit, Rev. Biotech.. 12:437 (1992)). 

Another form of an antibody fragment is a peptide that forms a single 
complementarity-determining region (CDR). CDR peptides ("minimal 
5 recognition units") can be obtained by constructing genes encoding the CDR of 
an antibody of interest. Such genes are prepared, for example, by using the 
polymerase chain reaction to synthesize the variable region from RNA of 
antibody-producing cells (Larrick et aL, Methods: A Companion to Methods in 
Enzymology, Vol. 2, page 106 (1991)). ^ 
10 An antibody of the invention may be coupled to a toxin. Such antibodies 

may be xised to treat animals, including himians, that are infected with the virus 
that is etiologically linked to severe acute respiratory syndrome. For example, 
an antibody that binds to the spike protein of the coronavirus that is etiologically 
linked to severe acute respiratory syndrome may be coupled to a tetanus toxin 
15 and administered to an animal suffering from infection by the aforementioned 
virus. The toxin-coupled antibody is thought to bind to a portion of a spike 
protein presented on an infected cell, and then kill the infected cell. 

An antibody of the invention may be coupled to a detectable tag. Such 
antibodies may be used within diagnostic assays to determine if an animal, such 
20 as a human, is infected with SARS-CoV. Examples of detectable tags include, 
fluorescent proteins (i.e., green fluorescent protem, red fluorescent protein, 
yellow fluorescent protein), fluorescent markers (i.e., fluorescein isothiocyanate, 
rhodamine, texas red), radiolabels (i.e., ^H, ^^^I), enzymes (i.e., /3- 
galactosidase, horseradish peroxidase, /3-glucuronidase, alkaline phosphatase), or 
25 an affinity tag (i.e., avidin, biotin, streptavidin). Methods to couple antibodies to 
a detectable tag are known in the art. Harlow et al.. Antibodies: A Laboratory 
Manual, page 319 (Cold Spring Harbor Pub. 1988). 

The invention also provides aptamers to the polypeptides and peptide 
fragments of the invention, Aptamers of the invention can be peptide or nucleic 
30 acid aptamers. Peptide aptamers are peptides that bmd to a polypeptide or 
peptide fragment of the invention with affinities that are often comparable to 
those for monoclonal antibody-antigen complexes. Similarly, nucleic acid 
aptamers are nucleic acids that bind to a polypeptide or peptide fragment of the 
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invention with strong affinities, for example, affinities that are often comparable 
to those for monoclonal antibody-antigen complexes. 

In one example, nucleic acid aptamers can be isolated through use of a 
library of random oligonucleotide sequences. The library is screened to 
5 ascertain which oligonucleotide binds to the S polypeptides and peptide 
fragments of the invention. The boimd oligonucleotides are eluted from the 
immobiUzed polypeptides or peptide fragments and are then amplified by PGR. 
This process may be repeated to select for aptamers having high affinity for the 
polypeptides and peptide fragments of the invention. The sequence of the 

10 nucleic acid coding for the aptamers can then be determined and cloned into a 
suitable vector to facilitate production and maintenance of the desired aptamer.. 

Peptide aptamers can be isolated by mRNA display of a Ubrary that 
contains a promoter, a start codon, a nucleic acid sequence that encodes random 
peptides. In some embodiments, the DNA library also includes a nucleic acid 

1 5 segment that codes for a histidine tag. This library is transcribed using a suitable 
polymerase, such as T7 RNA polymerase, after which a puromycin-containing 
poly A linker is ligated onto the 3' end of the newly formed mRNAs. When 
these mRNAs are tr^islated in vitro, the nascent peptides form covalent bonds to 
the puromycin of the linker to form an mRNA-peptide fiision molecule. The 

20 mRNA-peptide fiision molecules are then purified through use of Ni-NTA 

agarose and oUgo-dT-cellulose. The mRNA portion of the fiision molecule is 
then reverse transcribed. The double-stranded DNA/RNA-peptide fusion 
molecules are then incubated with a polypeptide or peptide fragment of the 
invention and unbound ftision molecules are washed away. The bound ftision 

25 molecules are eluted from the immobilized polypeptides or peptide fragments 
and are then amplified by PGR. This process may be repeated to select for 
aptamers having high affinity for the polypeptides and peptide fragments of the 
invention. The sequence of the nucleic acid coding for the aptamers can then be 
determined and cloned into a suitable vector. Methods for the preparation of 

30 peptide ^tamers have been described (Wilson et al., Proc. Natl. Acad. Sci.. 

98:3750 (2001)). Accordingly, the invention provides aptamers that recognize 
the polypeptides and peptide fragments of the invention. 

V. Pharmaceutical compositions of the invention 
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The invention provides pharmaceutical compositions containing an 
antibody that binds to an amino acid sequence as set forth in SEQ ID NO: 1, 13, 
14, 15, 20-59, 60, 61, 62, 63 or a fragment of SEQ ID NO: 1, or a conservative 
variant thereof, and a pharmaceutically acceptable carrier. In some 
5 embodiments, the antibody binds to a peptide having SEQ ID NO:58 or 59. 
Antibodies that bind to the P540 peptide (SEQ ID NO:59) are highly effective, 
and can detect spike polypeptides even after extensive dilution. For example, a 
P540 antibody preparation at dilution 1 : 10,000 could still detect spike ' 
polypeptides. 

1 0 The pharmaceutical compositions of the invention may be prepared in 

many forms that include tablets, hard or soft gelatin capsules, aqueous solutions, 
suspensions, and liposomes and other slow-release formulations, such as shaped 
polymeric gels. An oral dosage form may be formulated such that the antibody 
is released into the intestine after passing through the stomach. Such 
1 5 formulations are described in U.S. Patent No. 6,306,434 and in the references 
contained therein. 

Oral liquid pharmaceutical compositions may be in the form of, for 
example, aqueous or oily suspensions, solutions, emulsions, symps or elixirs, or 
may be presented as a dry product for constitution with water or other suitable 
20 vehicle before use. Such liquid pharmaceutical compositions may contain 
conventional additives such as suspenduig agents, emulsifying agents, non- 
aqueous vehicles (which may include edible oils), or preservatives. 

An antibody can be formulated for parenteral administration (e.g., by 
injection, for example, bolus injection or continuous infixsion) and maybe 
25 presented in unit dosage form in ampules, prefilled syringes, small volume 
inftision containers or multi-dose containers with an added preservative. The 
pharmaceutical compositions may take such forms as suspensions, solutions, or 
emulsions in oily or aqueous vehicles, and may contain formulatory agents such 
as suspending, stabilizing and/or dispersing agents. Pharmaceutical 
30 compositions suitable for rectal administration can be prepared as unit dose 
suppositories. Suitable carriers include saline solution and other materials 
commonly used in the art. 

For administration by inhalation, an antibody can be conveniently 
delivered from an insufflator, nebulizer or a pressurized pack or other convenient 



42 



wo 2005/010034 



PCT/US2004/023345 



means of delivering an aerosol spray. Pressurized packs may comprise a suitable 
propellaut such as dichlorodifluoromethane, trichlorofluoromethane, 
dichlorotetrafluoroethane, carbon dioxide or other suitable gas. In the case of a 
pressurized aerosol, the dosage unit may be determined by providing a valve to 
5 deliver a metered amount. 

Altematively, for administration by inhalation or insufflation, an 
antibody may take the form of a dry powder composition, for example, a powder 
mix of a modulator and a suitable powder base such as lactose or starch. The 
powder composition may be presented in unit dosage form in, for example, 

1 0 capsules or cartridges or, e.g., gelatin or blister packs from which the powder 
may be administered with the aid of an inhalator or insufflator. For intra-nasal 
administration, an antibody may be administered via a liquid spray, such as via a 
plastic bottle atomizer. 

Pharmaceutical compositions of the invention may also contain other 

15 ingredients such as flavorings, colorings, anti-microbial agents, or preservatives. 
It will be appreciated that the amount of an antibody required for use in 
treatment will vary not only with the particular carrier selected but also with the 
route of administration, the nature of the condition being treated and the age and 
condition of the patient. Ultimately the attendant health care provider may 

20 determine proper dosage. In addition, a pharmaceutical composition may be 
formulated as a single unit dosage form. 

VI. Method to immunize, treat, and diagnose an animal against severe acute 
respiratory syndrome 

25 The invention provides a method to immunize an animal against severe 

acute respiratory syndrome. The method relates to administering a 
therapeutically effective amount of an antibody that binds to an amino acid 
sequence as set forth in SEQ ID NO: 1, 13, 14, 15, 20-59, 60, 61, 62, 63 or a 
fragment of SEQ ID NO: 1, or a conservative variant thereof to an animal; 

30 administering an effective amount of an immune composition to an animal; 
administering an effective amount of a viral vaccine to an animal; or 
administering an effective amount of a nucleic acid vaccine to an animal. The 
animal may be a mammal, such as a human. Methods to administer vaccines 
and immune compositions have been described herein and are known ia the art. 
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An animal may also be treated for infection by SARS-CoV through 
passive immimization according to the invention. For example, antibodies that 
bind to an amino acid sequence as set forth in SEQ ID NO: 1, 13, 14, 15, 20-55, 
60, 61, 62, 63 or a fragment of SEQ ID NO: 1, or a conservative variant thereof 
5 may be administered to an animal, such as a human, that is infected with SARS- 
CoV. Such administration may be suitable in situations where a patient is 
immune compromised and is unable to mount an effective immune response 
against SARS-CoV, or to a vaccine or inmiune composition. 

The invention provides a method to diagnose severe acute respiratory 
10 syndrome in an animal that involves contacting a biological sample obtained 
from the animal, such as tissue samples, blood, mucus, or sahva, with an 
antibody that binds to an amino acid sequence as set forth in SEQ ID NO: 1, 13, 
14, 15, 20-59, 60, 61, 62, 33 or a fragment of SEQ ID NO; 1, and determining if 
the antibody binds to the biological sample. Diagnostic assays that utilize 
1 5 antibodies to detect the presence of an antigen in a biological sample are well 

known in the art. Briefly, an antibody of the invention may be immobilized on a 
surface. A biological sample can then be contacted with the irmnobilized 
antibody such that an antigen contained in the sample is bound by the antibody 
to form an antibody-antigen complex. The sample may then be optionally 
20 washed to remove imbound materials. A second antibody of the invention that is 
coupled to a detectable tag, such as an enzyme or radiolabel, can then be 
contacted with the antibody-antigen complex such that the enzyme or radiolabel 
is immobilized on the surface. The detectable tag can then be detected to 
determine if an antigen was present in the biological sample. In another 
25 example, a biological sample can be immobilized on a surface. An antibody of 
the invention that is coupled to a detectable tag is then contacted with the 
immobilized biological sample and any unbound material is washed away. The 
presence of the detectable tag is then detected to determine whether the 
biological sample contained an antigen. Examples of such assays are well 
30 known in the art and include, enzyme-linked immunosorbant assays, 
radioimmuno assays, and the like. 

Nucleic acid based methods may also be used to diagnose severe acute 
respiratory syndrome. In one example, polymerase chain reaction (PGR) may be 
used to diagnose SARS-CoV infection. Briefly, a biological sample, such as a 
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tissue sample, blood, mucus, or saliva, is obtained from an animal. The nucleic 
acids within the sample are then extracted using common methods, such as 
organic extraction. The extracted nucleic acids are then mixed with forward and 
reverse primers that anneal to nucleic acids that encode SARS proteins, 
5 polymerase, nucleotides, and typically a buffer that includes components that 
allow the polymerase to extend the forward and reverse primers using the SARS 
nucleic acid as a template. The presence of amplified DNA between the forward 
and reverse primers is then detected to determine if the sample contained SARS 
originated nucleic acid. Nucleic acid hybridization techniques, such as Northern 
10 and Southem blotting, may also be used to detect the presence of SARS nucleic 
acids in a biological sample. 

Vn. Kits 

The invention provides a kit which contains packaging material and an 

15 antibody that binds to an amino acid sequence as set forth in SEQ ID NO: 1, 13, 
14, 15, 45, 46, or 47, 58, 59, 61, 62, 63 or a fragment of SEQ ID NO: 1, or a 
conservative variant thereof. The kit may also contain a syringe to allow for 
injection of the antibody contained within the kit into an animal, such as a 
human. In another embodiment, the invention provides a kit that may contain 

20 packaging material, and an antibody that binds to an amino acid sequence as set 
forth in SEQ ID NO: 1, 13, 14, 15, 20-59, 60, 61, 62, 63 or a fragment of SEQ 
ID NO: 1, or a conservative variant thereof that is formulated for administration 
to an animal, such as a human. In some embodiments, the antibody binds to an 
amino acid sequence set forth in SEQ ID NO:59. In other embodiments, the 

25 antibody binds to an amino acid sequence as set forth in SEQ ID NO: 58. Such a 
kit may optionally contain a syringe to allow for injection of the antibody 
contained within the kit into an animal, such as a human. 

The invention also provides a kit which contains packaging material and 
DNA vaccine having a DNA molecule or expression vector encoding a 

30 polypeptide with an amino acid sequence as set forth in SEQ ID NO: 1, 13, 14, 
15, 45, 46, or 47, 58, 59, 61, 62, 63 or a fragment of SEQ ID NO: 1, or a 
conservative variant thereof. The kit may also contain a device for administering 
the DNA vaccine (e.g. a syringe or gene gun) to allow for administration of the 
vaccine contained within the kit into an animal, such as a human. 
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The invention also provides a kit wiiich contains packaging material and 
vaccine composition that includes a polypeptide with an amino acid sequence as 
set forth in SEQ ID NO: 1, 13, 14, 15, 45, 46, or 47, 58, 59, 61, 62. 63 or a 
fragment of SEQ ID NO: 1, or a conservative variant thereof. The kit may also 
5 contain a device for administering the vaccine (e.g. a syringe) to allow for 
administration of the vaccine contained within the kit into an animal, such as a 
human. 

The invention also provides a kit for detecting SARS-CoV infection, 
which contains packaging material and a polypeptide with an amino acid 

10 sequence as set forth in SEQ ID NO: 1, 13, 14, 15, 45, 46, or 47, 58, 59, 61, 62, 
63 or a fragmesnt of SEQ ID NO: 1, or a conservative variant thereof. The 
polypeptide(s) can be immobilized onto a solid support. Such a kit may be used 
for detection of antibodies directed against the SARS-CoV in the serum of 
infected animals or humans. The kit can also contam a means for detecting 

1 5 binding of such antibodies to the S polypeptide(s). 

Vm. Amino Acid sequence of a ftdl-leneth spike (St pro tein raminn arids 1 - 
1255^ from the Tor2 isolate of the SARS-CoV vims 

MFIFLLFLTLTSGSDLDRCTTFDDVQAPNYTQHTSSMRGVYYPDEIFRSD 
20 TLYLTQDLFIJ>FYSlWTGraTMTFGNPVIPFKDGrYFAATEKSNVVRG 
WWGSTMNNKSQSVniNNSTNWniAa^IXZIDNPFFAVSKPMGTQ 
MIFDNAFNCTFEYISDAFSLDVSEKSGNFKHLREFVFKNKDGFLYVYKG 
YQPmVViaJLPSGFNTLKPIFKLPLGINITNFRAILTAFSPAQDIWGTSAAA 
YFVGYLKPTTFMLKYDENGTITDAVDCSQNPLAELKCSVKSFEIDKGIYQ 
25 TSNFRWPSGDVVRFPNITNLCPFGEVFNATKFPS VYAWERKKISNC V AD 
YSVLYNSTFFSTFKCYGVSATKLNDLCFSNVYADSFWKGDDVRQIAPG 
QTGVIADYNYKLPDDFMGCVLAWNTRNIDATSTGNYNYKYRYLRHGK 
LRPFERDISNVPFSPDGKPCTPPALNCYWPLNDYGFYTTTGIGYQPYRW 
VLSFELLNAPATVCGPKLSTDLIKNQCVNFNFNGLTGTGVLTPSSKRFQP 
30 FQQFGRDVSDFTDSVRDPKTSEILDISPCAFGGVSVITPGTNASSEVAVLY 
QDVNCTDVSTAJHADQLTPAWRn^'STGNNVFQTQAGCLIGAEHVDTSYE 
CDIPIGAGICASYHTVSLLRSTSQKSIVAYTMSLGADSSIAYSNNTIAIPTN 
FSISITTEVMPVSMAKTSVDCNMYICGDSTECANLLLQYGSFCTQLNRAL 
SGIAAEQDR>miEWAQVKQMYKTPTLKYFGGFNFSQILPDPLKPTKRSF 
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ffiDLLFNKVTLADAGFMKQYGECLGDINARDLICAQKFNGLTVLPPLLT 

DDMIAAYTAALVSGTATAGWTFGAGAALQIPFAMQMAYRFNGIGVTQ 

NVLYENQKQIANQFNECAISQIQESLTTTSTALGKLQDWNQNAQALNTL 

VKQLSSNFGAISSVLNDILSRLDKVEAEVQIDRLITGRLQSLQTYVTQQLI 

5 RAAEIRASA^^.AATKMSECVLGQSKRVDFCGKGYHLMSFPQAAPHGVV 

FLHVTYWSQERMiTTAPAICHEGKAYFPREGVFVrFNGTSWFITQRNFFS 

PQnTTDNTFVSGNCDWIGIINNTVYDPLQPELDSFKEELDKYFKNHTSP 

DVDLGDISGINASVVNIQKEIDRLNEVAKNLNESLIDLQELGKYEQYIKW 

PWYVWLGFIAGLIAIVMVTILLCCMTSCCSCLKGACSCGSCCKFDEDDSE 
1 0 PVLKGVKLHYT (SEQ ID NO: 1) 

IX- Nucleic Acid sequenc e of a full-length spike (S^ protein (nucleotides 1- 
3768^ 

ATGTTTATTTTCTTATTATTTCTTACTCTCACTAGTGGTAGTGACCTTG 
1 5 ACCGGTGCACCACTTTTGATGATGTTCAAGCTCCTAATTACACTCAAC 
ATACTTCATCTATGAGGGGGGTTTACTATCCTGATGAAATTTTTAGAT 
CAGACACTCTTTATTTAACTCAGGATTTATTTCTTCCATTTTATTCTAA 
TGTTACAGGGTTTCATACTATTAATCATACGTTTGGCAACCCTGTCAT 
ACCTTTTAAGGATGGTATTTATTTTGCTGCCACAGAGAAATCAAATGT 
20 TGTCCGTGGTTGGGTTTTTGGTTCTACCATGAACAACAAGTCACAGTC 
GGTGATTATTATTAACAATTCTACTAATGTTGTTATACGAGCATGTAA 
CTTTGAATTGTGTGACAACCCTTTCTTTGCTGTTTCTAAACCCATGGG 
TACACAGACACATACTATGATATTCGATAATGCATTTAATTGCACTTT 
CGAGTACATATCTGATGCCTTTTCGCTTGATGTTTCAGAAAAGTCAGG 
25 TAATTTTAAACACTTACGAGAGTTTGTGTTTAAAAATAAAGATGGGTT 
TCTCTATGTTTATAAGGGCTATCAACCTATAGATGTAGTTCGTGATCT 
ACCTTCTGGTTTTAACACTTTGAAACCTATTTTTAAGTTGCCTCTTGGT 
ATTAACATTACAAATTTTAGAGCCATTCTTACAGCCTTTTCACCTGCT 
CAAGACATTTGGGGCACGTCAGCTGCAGCCTATTTTGTTGGCTATTTA 
30 AAGCCAACTACATTTATGCTCAAGTATGATGAAAATGGTACAATCAC 
AGATGCTGTTGATTGTTCTCAAAATCCACTTGCTGAACTCAAATGCTC 
TGTTAAGAGCTTTGAGATTGACAAAGGAATTTACCAGACCTCTAATTT 
CAGGGTTGTTCCCTCAGGAGATGTTGTGAGATTCCCTAATATTACAAA 
CTTGTGTCCTTTTGGAGAGGTTTTTAATGCTACTAAATTCCCTTCTGTC 
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TATGCATGGGAGAGAAAAAAAATTTCTAATTGTGTTGCTGATTACTCT 
GTGCTCTACAACTCAACATTTTTTTCAACCTTTAAGTGCTATGGCGTT 
TCTGCCACTAAGTTGAATGATCTTTGCTTCTCCAATGTCTATGCAGAT 
TCTTTTGTAGTCAAGGGAGATGATGTAAGACAAATAGCGCCAGGACA 
5 AACTGGTGTTATTGCTGATTATAATTATAAATTGCCAGATGATTTCAT 
GGGTTGTGTCCTTGCTTGGAATACTAGGAACATTGATGCTACTTCAAC 
TGGTAATTATAATTATAAATATAGGTATCTTAGACATGGCAAGCTTA 
GGCCCTTTGAGAGAGACATATCTAATGTGCCTTTCTCCCCTGATGGCA 
AACCTTGCACCCCACCTGCTCTTAATTGTTATTGGCCATTAAATGATT 

10 ATGGTTTTTACACCACTACTGGCATTGGCTACCAACCTTACAGAGTTG 
TAGTACTTTCTTTTGAACTTTTAAATGCACCGGCCACGGnTGTGGAC 
CAAAATTATCCACTGACCTTATTAAGAACCAGTGTGTCAATTTTAATT 
TTAATGGACTCACTGGTACTGGTGTGTTAACTCCTTCTTCAAAGAGAT 
TTCAACCATTTCAACAATTTGGCCGTGATGTTTCTGATTTCACTGATT 

15 CCGTTCGAGATCCTAAAACATCTGAAATATTAGACATTTCACCTTGCG 
CTTTTGGGGGTGTAAGTGTAATTACACCTGGAACAAATGCTTCATCTG 
AAGTTGCTGTTCTATATCAAGATGTTAACTGCACTGATGTTTCTACAG 
CAATTCATGCAGATCAACTCACACCAGCTTGGCGCATATATTCTACTG 
GAAACAATGTATTCCAGACTCAAGCAGGCTGTCTTATAGGAGCTGAG 

20 CATGTCGACACTTCTTATGAGTGCGACATTCCTATTGGAGCTGGCATT 
TGTGCTAGTTACCATACAGTTTCTTTATTACGTAGTACTAGCCAAAAA 
TCTATTGTGGCTTATACTATGTCTTTAGGTGCTGATAGTTCAATTGCTT 
ACTCTAATAACACCATTGCTATACCTACTAACTTTTCAATTAGCATTA 
CTACAGAAGTAATGCCTGTTTCTATGGCTAAAACCTCCGTAGATTGTA 

25 ATATGTACATCTGCGGAGATTCTACTGAATGTGCTAATTTGCTTCTCC 
AATATGGTAGCTTTTGCACACAACTAAATCGTGCACTCTCAGGTATTG 
CTGCTGAACAGGATCGCAACACACGTGAAGTGTTCGCTCAAGTCAAA 
CAAATGTACAAAACCCCAACTTTGAAATATTTTGGTGGTTTTAATTTT 
TCACAAATATTACCTGACCCTCTAAAGCCAACTAAGAGGTCTTTTATT 

30 GAGGACTTGCTCTTTAATAAGGTGACACTCGCTGATGCTGGCTTCATG 
AAGCAATATGGCGAATGCCTAGGTGATATTAATGCTAGAGATCTCAT 
TTGTGCGCAGAAGTTCAATGGACTTACAGTGTTGCCACCTCTGCTCAC 
TGATGATATGATTGCTGCCTACACTGCTGCTCTAGTTAGTGGTACTGC 
CACTGCTGGATGGACATTTGGTGCTGGCGCTGCTCTTCAAATACCTTT 
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TGCTATGCAAATGGCATATAGGTTCAATGGCATTGGAGTTACCCAAA 
ATGTTCTCTATGAGAACCAAAAACAAATCGCCAACCAATTTAACAAG 
GCGATTAGTCAAATTCAAGAATCACTTACAACAACATCAACTGCATT 
GGGCAAGCTGCAAGACGTTGTTAACCAGAATGCTCAAGCATTAAACA 
5 CACTTGTTAAACAACTTAGCTCTAATTTTGGTGCAATTTCAAGTGTGC 
TAAATGATATCCTTTCGCGACTTGATAAAGTCGAGGCGGAGGTACAA 
ATTGACAGGTTAATTACAGGCAGACTTCAAAGCCTTCAAACCTATGT 
AACACAACAACTAATCAGGGCTGCTGAAATCAGGGCTTCTGCTAATC 
TTGCTGCTACTAAAATGTCTGAGTGTGTTCTTGGACAATCAAAAAGA 
10 GTTGACTTTTGTGGAAAGGGCTACCACCTTATGTCCTTCCCACAAGCA 
GCCCCGCATGGTGTTGTCTTCCTACATGTCACGTATGTGCCATCCCAG 
GAGAGGAACTTCACCACAGCGCCAGCAATTTGTCATGAAGGCAAAGC 
ATACTTCCCTCGTGAAGGTGTTTTTGTGTTTAATGGCACTTCTTGGTTT 
ATTACACAGAGGAACTTCTTTTCTCCACAAATAATTACTACAGACAAT 
1 5 AC ATTTGTCTCAGGAAATTGTGATGTCGTTATTGGCATCATTAACAAC 
ACAGTTTATGATCCTCTGCAACCTGAGCTCGACTCATTCAAAGAAGA 
GCTGGACAAGTACTTCAAAAATCATACATCACCAGATGTTGATCTTG 
GCGACATTTCAGGCATTAACGCTTCTGTCGTCAACATTCAAAAAGAA 
ATTGACCGCCTCAATGAGGTCGCTAAAAATTTAAATGAATCACTCAT 
20 TGACCTTCAAGAATTGGGAAAATATGAGCAATATATTAAATGGCCTT 
GGTATGTTTGGCTCGGCTTCATTGCTGGACTAATTGCCATCGTCATGG 
TTACAATCTTGCTTTGTTGCATGACTAGTTGTTGCAGTTGCCTCAAGG 
GTGCATGCTCTTGTGGTTCTTGCTGCAAGTTTGATGAGGATGACTCTG 
AGCCAGTTCTCAAGGGTGTCAAATTACATTACACATAA (SEQ ID NO: 
25 2) 

Example 1 

Cloninp of the spike protein 
The nucleic acid sequence encoding the full length spike protein was 
30 obtained through use of overlapping polymerase chain reaction (PGR). 

Overlapping clones containing fragments of the spike protein were obtained 
from the British Columbia Cancer Agency (Vancouver, British Columbia). The 
following primers were used during the PCR reactions to amplify the nucleic 
acid sequence encoding the full-length spike protein of SARS-CoV: Clone 1 : 
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Fomard primer: 5 A GTC GGATCC GGT AGG CTT ATC ATT AGA G - 3' 
(SEQ ID NO: 3); Reverse primer: 5'- CCA TCA GGG GAG AAA GGC AC-3 
(SEQ ID NO: 4). Clone 2: Forward primer: 5'- GTG CCT TTC TCC CCT GAT 
GG-3' (SEQ ID NO: 5); Reverse primer: 5'- GAA GAG CAG CGC GAG CAC 
5 C-3' (SEQ ID NO: 6). Clone 3: Forward primer: 5'- GGT GCT GGC GCT GCT 
CTT C-3* (SEQ ID NO: 7); Reverse primer: 5'- A CTG TCT AGA GTT CGT 
TTA TGT GTA ATG-3 (SEQ ID NO: 8). 

The nucleic acid segment that resulted from overlapping PCR between 
the nucleic acid segments generated with the above pairs of primers contain 

10 amino acid residues from number 1 to number 1255 of the spike protein of the 
virus (SARS-CoV) that is etiologically linked to severe acute respiratory 
syndrome. The underlined primer sequences represent restriction enzyme 
cutting sites for BamHI and Xbal that were used to clone the amplified fragment 
into pCDNA3(+) (Invitrogen, Carisbad, California). 

1 5 The full length spike protein gene has been cloned as shown in Fig. 1 . 

Fig. 1 shows a gel for the nucleic acid segment encoding the fidl length spike 
protein inserted into the pCDNA3.1(+) vector that has been digested with the 
restriction enzymes (Lane 2: BamHI and Xbal; Lane 3: Hindlll). 

20 Example! 

Generation of ammo-termmal (Sn and carboxvl-terminal (S2^ frag ments of th^ 

ftill length spike protein 
Computer analysis identified a potential ftmctional separation site 
between the amino-terminus (SI) and the carboxyl-terminus (S2) of the spike 
25 protein. The separation site between SI and S2 is between positions between 
758 and 761 (^^^RNTR^^^) relative to SEQ ID NO: 1. PCR was used to create 
nucleic acids that code for the amino-terminal fragment (SI), and the carboxyl- 
terminal fragment (S2) of the spike protein. 

The following primers, SI forward primer: 5'-AGTC GGA TCC GAC 
30 CGG TGC ACC ACT TTT G-3' (SEQ ID NO: 9), and the reverse primer, S 1 
Reverse primer: 5'-AGTC GGG CCC CTG TTC AGC AGC AAT ACC-3 ' 
(SEQ ID NO: 10), were used to prepare a nucleic acid segment coding for amino 
acid residues 17-757 of the spike protein. Two restriction sites, BamHI and 
Apal, underlined in the two primers were used to clone the nucleic acid segment 
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coding for the amino-terminal fragment of the spike protein (SI) gene into the 
pSecTag2B plasmid for expression. 

The following pair of primers, S2 Forward: 5'-ACTG GGATCC GAA 
GTG TTC GCT CAA GTC-3' (SEQ ID NO: 1 1), and S2 Reverse: 5'-ACTG 
5 TCTAGA TTG CTC ATA TTT TCC C-3 ' (SEQ ID NO: 12), were used within 
a PGR reaction to prepare a nucleic acid segment coding for amino acid residues 
762- 11 89 of the spike protein. Two restriction sites, BamHI and Xbal, 
underlined in the two primers were used to clone the nucleic acid segment 
coding for the carboxyl-terminal fragment of the spike protein (S2) gene into 
1 0 pCDNAS . 1 (+) plasmid for expression. 

To create a fragment containing residues 272-537, the following pair of 
primers was used for PGR amplification: primer 5' 

GATCGGATCCGGTACAATCACAG 3' (SEQ ID NO:64) and primer 5' 
GATCGGGCCCGACACACTGGTTC 3' (SEQ ID NO:65). The amplified 

1 5 fragment was digested with BamHI and Apal and ligated into pSecTag2B 

digested with the same restriction enzymes. A schematic diagram of the position 
of many of the soluble spike protein fragments within the fiill-length spike 
protein is provided in Fig. IB. 

In some cases, nucleic acids encoding the S fragments and ftiU-length S 

20 polypeptides had their native leader sequence (spike protein amino acids 1-16, 
MFIFLLFLTLTSGSDL (SEQ ID NO:60)) replaced with a mouse k chain leader 
sequence (AffiTDTLLLWVLLLWVPGSTGD) (SEQ ID NO: 16) to permit 
secretion, as described below. 



25 Example 3 

Generation of the whole soluble spike protein (sS) lacking the cytoplasmic tail 

and the transmembrane domain 
The following pair of primers were used to generate a nucleic acid 
segment encoding a fragment of the spike protein (sS) lacking the cytoplasmic 
30 tail having amino acids 17-1189 of SEQ ID NO: 1: SI Forward: 5'- AGTC 

GGATCC GAC CGG TGC ACC ACT TTT G-3' (SEQ ID NO: 9), and Reverse: 
5' ACTG TCTAGA TTG CTC ATA TTT TCC C-3' (SEQ ID NO: 12). 
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Example 4 

Expressio n of an amino-terminal and carboxyl-terminal fragment of a spike 
' protein 

Expression will be done by trausfecting an expression construct 
5 containing the pSecTag2B or pCDNA3. 1 (+) plasmid and a nucleic acid insert 
that encodes an amino-teiminal (SI), a carboxyl-terminal (S2) fragment, or a 
fragment of the spike protein of SARS-CoV that lacks the cytoplasmic tail and 
the transmembrane domain, into 293 or Vero E6 cells. It is thought that 
elimination of the transmembrane domain allows the polypeptides and peptide 
10 fragments to be soluble in an aqueous solution. Expression efficiency of the 
encoded fragments will then be tested. Once a positive signal is obtained as 
determined with gel analysis, a stably transfected cell line will be generated. 
The full length spike protein, and fragments thereof will be purified according to 
methods that are routinely used with other highly glycosylated proteins. Such as 
15 use of a lentil lectin column for large production. The resulting proteins: soluble 
SI (sSl), soluble S2 (sS2) and whole soluble S (sS) will have the following 
amino acid sequences. Bold lettering denotes the signal peptide which can be 
cleaved so the excreted protein will not contain it. 

20 Amino ac id sequence of a soluble amino-terminal fragment of the spike protein 
(amino acids 17-757^ 

DRCTTFDDVQAPNYTQHTSSMRGVYYPDEIFRSDTLYLTQDLFLPFYSN 

VTGFHTINHTFGNPVIPFKDGIYFAATEKS^m^lGWWGSTMNNKSQSVI 

nNNSTNVVffiACNFELCDNPFFAVSKPMGTQTHTMIFDNAFNCTFEYISD 

25 AFSLDVSEKSGNFKHLREFVFKNKDGFLYVYKGYQPIDWRDLPSGFNT 
LKPIFKLPLGINITNFRAILTAFSPAQDIWGTSAAAYFVGYLKPTTFMLKY 
DENGTITDAYDCSQNPLAELKCSVKSFEIDKGIYQTSNFRVVPSGDWRF 
PNITNLCPFGEVFNATKFPSVYAWERKKISNCVADYSVLYNSTFFSTFKC 
YGVSATKLNDLCFSNVYADSFWKGDDVRQIAPGQTGVIADYNYKLPD 

30 DFMGCVLATOTRNIDATSTGNYNYKYRYIJIHGKLRPFERDISNVPFSPD 
GKPCTPPALNCYWPLNDYGFYTTTGIGYQPYRVWLSFELLNAPATVCG 
PKLSTDLIKNQCVNFNFNGLTGTGVLTPSSKRFQPFQQFGRDVSDFTDSV 
RDPKTSEILDISPCAFGGVSVITPGTNASSEVAVLYQDVNCTDVSTAIHAD 
QLTPAWRIYSTGNNVFQTQAGCLIGAEHVDTSYECDIPIGAGICASYHTV 
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SLLRSTSQKSIVAYTMSLGADSSIAYSNNTIAIPTNFSISITTEVMPVSMAK 
TSVDCNMYICGDSTECANLLLQYGSFCTQLNRALSGIAAEQ (SEQ ID 
NO: 13) 

5 Amino ac id sequence of a soluble carboxvl-terminal fragment of the spike 
protein (amino acids 762-1189^ 

EWAQVKQMYKTPTLKYFGGFNFSQILPDPLKPTECRSFIEDLLFNKVTLA 

DAGHVIKQYGECLGDINARDLICAQKFNGLTVLPPLLTDDMIAAYTAALV 

SGTATAGWTFGAGAALQIPFAMQMAYRFNGIGVTQNVLYENQKQIANQ 

10 FNKAISQIQESLTTTSTALGKLQDVVNQNAQALNTLVKQLSSNFGAISSV 
LNDILSRLDKVEAEVQroRLITGRLQSLQTYVTQQLIRAAEIRASANLAAT 
KMSECVLGQSKRVDFCGKGYHLMSFPQAAPHGVVFLHVTYVPSQERNF 
TTAPAICHEGKAYFPREGVFVFNGTSWFITQRNFFSPQnTTDNTFVSGNC 
DWIGHNNTVYDPLQPELDSFKEELDKYFKNHTSPDVDLGDISGINASVV 

1 5 NIQKEIDRLNEVAKNLNESLIDLQELGKYEQ (SEQ ID NO: 14) 

Amino acid sequence of a soluble spike protein having amino acids 17-757 and 
762-1 189 of SEO ID NO: 1 (lacking the signal peptide and the potential 
cleavage site) 

20 DRCTTFDDVQAPNYTQHTSSMRGVYYPDEIFRSDTLYLTQDLFLPFYSN 
VTGFHTINHTFGNPVffFKDGIYFAATEKSNVVRGWVFGSTMNNKSQSVI 
DNNSTNWIRACOTELCDNPFFAVSKPMGTQTHTISOTDNAFNCTFEYISD 
AFSLDVSEKSGNFKHLREFVFKNKDGFLYVYKGYQPIDVVRDLPSGFNT 
LKPIFKLPLGINITNFRAILTAFSPAQDIWGTSAAAYFVGYLKPTTFMLKY 

25 DENGTITDAVDCSQNPLAELKCSVKSFEIDKGIYQTSNFRVVPSGDVVRF 
PNITNLCPFGEVFNATKFPSVYAWERKKISNCVADYSVLYNSTFFSTFKC 
YGVSATKLNDLCFSNWADSFVVKGDDWQIAPGQTGVIADYNYKLPD 
DFMGCVLAWNTRNTOATSTGNYNYKYRYLRHGKLRPFERDISNVPFSPD 
GKPCTPPALNCYWPLNDYGFYTTTGIGYQPYRVVVLSFELLNAPATVCG 

30 PKI^TDLIKNQCVNFNFNGLTGTGVLTPSSKRFQPFQQFGRDVSDFTDSV 
RDPKTSEILDISPCAFGGVSVITPGTNASSEVAVLYQDVNCTDVSTAIHAD 
QLTPAWMYSTGMSfWQTQAGCLIGAEHVDTSYECDIPIGAGICASYHTV 
SLLRSTSQKSIVAYTMSLGADSSIAYSNNTIAIPTI^SISriTEVMPVS]!^^ 
TSVDCNMYICGDSTECANLLLQYGSFCTQLNRALSGIAAEQDEVFAQVK 
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QMYKTPTLKYFGGFNFSQILPDPLKPTKRSFIEDLLFNKVTLADAGFMKQ 
YGECLGDINARDLICAQKFNGLTVLPPLLTDDMIAAYTAALVSGTATAG 
WTFGAGAALQIPFAMQMAYRFNGIGVTQNVLYENQKQIANQFNKAIS 
QESLTTTSTALGKLQDVVNQNAQALNTLVKQLSS]^GAISSVLNDILSI^ 
5 DKVEAEVQmRLITGRLQSLQTYVTQQLIRAAEIRASANLAATKM 

GQSKRVDFCGKGYHLMSFPQAAPHGVWLHVTYWSQERNFTTAPM^ 
EGKAYFPREGVFVFNGTSWFlTQKNFFSPQnTTDNTFV 
TVYDPLQPELDSFKEELDKYFKNHTSPDVDLGDISGINASVVMQK^ 
NEVAKNLNESLIDLQELGKYEQ (SEQ ID NO: 15) 

10 

Example 5 

Generation of additional soluble fragments of the spike protein 
The nucleic acid sequence encoding a polypeptide containing amino 
acids 17-757 of SEQ ID NO: 1 was obtained through use of polymerase chain 

1 5 reaction (PGR). The following primers were used during the PGR reactions to 
amplify the nucleic acid sequence: Forward primer: 5' AGCT GGA TCC GAC 
CGG TGC ACC ACT TTT G 3' (SEQ ID NO: 9); and Reverse primer: 5' 
AGCT GGGCCC.CTG TTC AGC AGC AAT ACC 3' (SEQ ID NO: 10). The 
resulting PCR product was digested with BamHI and Apal and, encodes a 

20 polypeptide having an amino acid sequence corresponding to SEQ ID NO: 43. 
The digested PCR product was then ligated to pSecTag2B (Livitrogen, Carlsbad, 
California) that was digested with the same enzymes. The pSecTag2B construct 
containing the PCR product insert encodes a polypeptide having SEQ ID NO: 46 
with the mouse k chain leader sequence (METDTLLLWVLLLWVPGSTGD) 

25 (SEQ ID NO: 16) at the N-terminus for secretion, and a myc epitope 

(EQKLISEEDL) (SEQ ID NO: 17) plus a histidine tag (HHHHHH) (SEQ ID 
NO: 1 8) at the C-terminus for affinity pxiriflcation. 

The nucleic acid sequence encoding a polypeptide containing amino 
acids 17-276 of SEQ ID NO: 1 was obtained through use of polymerase chain 

30 reaction (PCR). The following primers were used during the PCR reactions to 
amplify the nucleic acid sequence: Forward primer: 5' AGCT GGA TCC GAC 
CGG TGC ACC ACT TTT G 3' (SEQ ID NO: 9); and Reverse primer: 5' 
CTAG CTC GAG CAA CAG CAT CTG TG 3' (SEQ ID NO: 19), The 
resultmg PCR product was digested with BamHI and Xhol and, encodes an 
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amino acid having SEQ ID NO: 44. The digested PGR product was then ligated 
to pSecTag2B (hivitrogen, Carlsbad, CaUfomia) that was digested with the same 
enzymes. The pSecTag2B construct containing the PGR product insert encodes 
a polypeptide having SEQ ID NO: 47 with the mouse k chain leader sequence 
5 (METDTLLLWVLLLWVPGSTGD) (SEQ ID NO: 16) at the N-terminus for 
secretion, and a myc epitope (EQKLISEEDL) (SEQ ID NO: 17) plus a histidine 
tag (HHHHHH) (SEQ ID NO: 18) at the C-terminus for affinity purification. 

The nucleic acid sequence encoding a polypeptide containing amino 
acids 17-537 of SEQ ID NO: 1 was obtained by digesting the nucleic acid 

10 sequence that encodes SEQ ID NO: 43 (as described above) with BamHI and 
HincII. The nucleic acid segment produced encodes a polypeptide having SEQ 
ID NO: 45. This nucleic acid segment was ligated into a pSecTag2B vector that 
was digested with BamHI and EcoRV. The pSecTag2B construct containing the 
PGR product insert encodes a polypeptide having SEQ ID NO: 48 with the 

1 5 mouse k chain leader sequence (METDTLLLWVLLLWVPGSTGD) (SEQ ID 
NO: 16) at the N-terminus for secretion, and a myc epitope (EQKLISEEDL) 
(SEQ ID NO: 17) plus a histidine tag (HHHHHH) (SEQ ID NO: 18) at the G- 
terminus for affinity purification. 

The expression of these peptide firagments in mammalian cells is 

20 illustrated in Fig. 3. This figure shows that the peptide fragments can be 
secreted into medium in which cells that express the peptide fi-agments are 
grown. Fig. 3 also indicates that the peptide fragments are soluble in aqueous 
medium. 
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Table 1 

Examples of additional peptide fragments of the invention 



SEQID 
NUMBER 


Amino Acid 
Position 
Relative to SEQ 
ID NO: 1 


Amino Acid Sequence 

Amino acid residues in bold (amino acid 
residues 1-16) identify a signal sequence. 


20 


1-100 


MFIFLLFLTLTSGSDLDRCTTFDDVQAP 
NYTQHTSSMRGVYYPDEIFRSDTLYLTQ 
DLFLPFYSNVTGFHTINHTFGNPVIPFECD 
GIYFAATEKSNVVRG 


21 


101-200 


wvfgstmnnksqsviiinnstnVviracn 
felcdnpffavskpmgtqthtmifdnaf 
nctfeyisdafsldvseksgnfkhlrbfv 
fknkdgflyvykgy 


22 


201-300 


qpidvvrdlpsgfntlkpifklplginitn 
frailtafspaqdiwgtsaaayfvgylk 
pttfmlkydengtitdavdcsqnplael 

KCSVKSFEIDKGIY 


23 


301-400 


qtsnfrvvpsgdwrfpnitnlcpfgevf 
natkfpsvyawerkxisncvadysvly 
nstffstfkcygvsatklndlcfsnvya 
dsfwkgddvrqiapg 


24 


401-500 


qtgviadynyklpddfmgcvlawntrn 
idatstgnynykyrylrhgklrpferdi 
snvpfspdgkpctppalncywplndygf 
ytttgigyqpyrvwls 


25 


501-600 


fellnapatvcgpklstdliknqcvnfn 
fngltgtgvltpsskeifqpfqqfgrdvs 
dftdsvrdpktseildispcafggvsvitp 
gtnassevavlyqd 


26 


601-700 


VNCTDVSTAIHADQLTPAWRIYSTGNNV 
FQTQAGCLIGAEHVDTSYECDIPIGAGIC 
ASYHTVSLLRSTSQKSIVAYTMSLGADS 
SIAYSNNTIAIPTNF 


27 


701-800 


SISITTEVMPVSMAKTSVDCNMYICGDST 
ECANLLLQYGSFCTQLNRALSGIAAEQD 
RNTREVFAQVKQMYKTPTLKYFGGFNF 
SQILPDPLKPTKRSFI 


28 


801-900 


EDLLFNKVTLADAGFMKQYGECLGDIN 
ARDLICAQKFNGLTVLPPLLTDDMIAAY 
TAALVSGTATAGWTFGAGAALQIPFAM 
QMAYRFNGIGVTQNVLYE 
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SEQID 
NUMBER 


Amino Acid 
Position 
Relative to SEQ 
ID NO: 1 


Amino Acid Sequence 

Amino acid residues in bold (amino acid 
residues 1-16) identify a signal sequence. 


29 


901-1000 


NQKQIANQFNKAISQIQESLTTTSTALGK 
LQDVVNQNAQALNTLVKQLSSNFGAISS 
VLNDILSRLDKVEAEVQIDRLITGRLQSL 
QTYVTQQLIRAAEI 


30 


1001-1100 


RASANLAATKMSECVLGQSKRVDFCGK 
GYHLMSFPQAAPHGVVFLHVTYVPSQE 
RNFTTAPAICHEGKAYFPREGVFVFNGT 
SWFITQRNFFSPQnTTD 


31 


1101-1189 


NTFVSGNCDWIGHNNTVYDPLQPELDS 

FKEELDKYFKNHTSPDVDLGDISGINASV 

VNIQKEIDRLNEVAKNLNESLIDLQELGK 

YEQ 


32 


1-200 


MFIFLLFLTLTSGSDLDRCTTFDDVQAP 

NYTQHTSSMRGVYYPDEIFRSDTLYLTQ 

DLFLPFYSNVTGFHTINHTFGNPVIPFKD 

GIYFAATEKSNVVRGWVFGSTMNNKSQ 

SVniNNSTNWIRACNFELCnDNPFFAVSK 

PMGTQTHIMIFDNAFNCTFEYISDAFSLD 

VSEKSGNFKHLREFVFKNKDGFLYVYK 

GY 


33 


201-400 


QPIDVVRDLPSGFNTLKPIFKLPLGINITN 

FRAILTAFSPAQDIWGTSAAAYFVGYLK 

PTTFMLKYDENGTITDAVDCSQNPLAEL 

KCSVKSFEIDKGIYQTSNFRVVPSGDWR 

FPNITNLCPFGEVFNATKFPSVYAWERK 

KISNCVADYSVLYNSTFFSTFKCYGVSA 

TKLNDLCFSNVYADSFWKGDDVRQIAP 

G 


34 


401-600 


QTGVIADYNYKLPDDFMGCVLAWNTRN 

IDATSTGNYNYKYRYLRHGKLRPFERDI 

SNVPFSPDGKPCTPPALNCYWPLNDYGF 

YTTTGIGYQPYRVVVLSFELLNAPATVC 

GPKLSTDLUCNQCVNFNFNGLTGTGVLT 

PSSKRFQPFQQFGRDVSDFTDSVRDPKTS 

EILDISPCAFGGVSVITPGTNASSEVAVLY 

QD 



f 



SEQID 
NUMBER 


Amino Acid 
Position 
Relative to SEQ 
ID NO: 1 


Amino Acid Sequence 

Amino acid residues in bold (amino acid 
residues 1-16) identify a signal sequence. 


35 


601-800 


VNCTDVSTAIHADQLTPAWRIYSTGNNV 

FQTQAGCLIGAEHVDTSYECDIPIGAGIC 

ASYHTVSLLRSTSQKSIVAYTMSLGADS 

SIAYSNNTIAIPTNFSISITTEVMPVSMAK 

TSVDCNMYICGDSTECANLLLQYGSFCT 

QLNRALSGIAAEQDRNTREVFAQVKQM 

YKTPTLKYFGGFNFSQILPDPLKPTKRSFI 


36 


801-1000 


EDLLFNKVTLADAGFMKQYGECLGDIN 

ARDLICAQKFNGLTVLPPLLTDDMIAAY 

TAALVSGTATAGWTFGAGAALQIPFAM 

QMAYRFNGIGVTQNVLYENQKQIANQF 

NKAISQIQESLTTTSTALGKLQDWNQN 

AQALNTLVKQLSSNFGAISSVLNDILSRL 

DKVEAEVQIDRLrrGRLQSLQTYVTQQLI - 

RAAEI 


37 


1001-1189 


RASANLAATKMSECVLGQSKRVDFCGK 

GYHLMSFPQAAPHGVVFLHVTYVPSQE 

RNFTTAPAICHEGKAYFPREGVFVFNGT 

SWFITQRNFFSPQinTDNTFVSGNCDWI 

GIINNTVYDPLQPELDSFKEELDKYFKNH 

TSPDVDLGDISGINASVVNIQKEIDRLNE 

VAKNLNESLIDLQELGKYEQ 


38 


1-400 


MFIFLLFLTLTSGSDLDRCTTFDDVQAP 
NYTQHTSSMRGVYYPDEIFRSDTLYLTQ 

DLFLPFYSNVTGFHTINHTFGNPVIPFKD 

GIYFAATEKSNWRGWVFGSTMNNKSQ 

SVmNNSTNVVIRACNFELCDNPFFAVSK 

PMGTQTHTMIFDNAFNCTFEYISDAFSLD 

VSEKSGNFKHLREFVFKNKDGFLYVYK 

GYQPIDWRDLPSGFNTLKPIFKLPLGINI 

TNFRAILTAFSPAQDIWGTSAAAYFVGY 

LKPTTFMLKYDENGTITDAVDCSQNPLA 

Jil^KL^o V Jvor JillJK(jrl Y(^l bJNrKV VPo 

VRFPNITNLCPFGEVFNATKFPSVYAWE 

RKKISNCVADYSVLYNSTFFSTFKCYGV 

SATKLNDLCFSNVYADSFWKGDDVRQI 

APG 



58 



wo 2005/010034 



PCT/US2004/023345 



SEQID 
NUMBER 


Amino Acid 
Position 
Relative to SEQ 
BONO: 1 


Amino Acid Sequence 

Amino acid residues in bold (amino acid 
residues 1-16) identify a signal sequence. 


39 


1-600 


MFIFLLFLTLTSGSDLDRCTTFDDVQAP 

NYTQHTSSMRGVYYPDEIFRSDTLYLTQ 

DLFLPFYSNVTGFHTINHTFGNPVIPFKD 

GIYFAATEKSNWRGWVFGSTMNNKSQ 

SVmNNSTNWIRACNFELCDNPFFAVSK 

PMGTQTHTMIFDNAFNCTFEYISDAFSLD 

VSEKSGNFKHLREFVFKNKDGFLYVYK 

GYQPIDWRDLPSGFNTLKPIFKLPLGINI 

TNFRAILTAFSPAQDIWGTSAAAYFVGY 

LKPTTFMLKYDENGTITDAVDCSQNPLA 

ELKCSVKSFEIDKGIYQTSlvIFRVVPSGDV 

VRFPNmsTLCPFGEVFNATKFFSVYAWE 

RKEaSNCVADYSVLYNSTFFSTFKCYGV 

SATKLNDLCFSNVYADSFVVKGDDVRQI 

APGQTGVIADYNYKLPDDFMGCVLAWN 

TRMDATSTGNYNYKYRYLRHGKLRPFE 

RDISNVPFSPDGKPCTPPALNCYWPL1^1D 

VIT-Th VTT'Tr^TriVr^"D\rD\/\7\7T CXTTJT T XT A "DA 

TVCGPKLSTDLIKNQCVNFNFNGLTGTG 
VLTPSSKRFQPFQQFGRDVSDFTDSVRDP 
KTSEE.DISPCAFGGVSVITPGTNASSEVA 
VLYQD 
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SEQID 
NUMBER 


Amino Acid 
Position 
Relative to SEQ 
ID NO: 1 


Amino Acid Sequence 

Amino acid residues in bold (amino acid 
residues 1-16) identify a signal sequence. 
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1-800 


MFIFLLFLTLTSGSDLDRCTTFDDVQAP 

NYTQHTSSMRGVYYPDEIFRSDTLYLTQ 

DLFLPFYSNVTGFHTINHTFGNPVIPFKD 

GIYFAATEKSNVVRGWVFGSTMNNKSQ 

SVniNNSTNVVIRACNFELCDNPFFAVSK 

PMGTQTHTMIFDNAFNCTFEYISDAFSLD 

VSEKSGNFKHLREFVFKNKDGFLYVYK 

GYQPIDWRDLPSGFNTLKPIFKLPLGINI 

TNFRAILTAFSPAQDIWGTSAAAYFVGY 

LKPTTFMLKYDENGTITDAVDCSQNPLA 

ELKCSVKSFEIDKGIYQTSNFRVVPSGDV 

VRFPNITNLCPFGEVFNATKFPSVYAWE 

RKKISNCVADYSVLYNSTFFSTFKCYGV 

SATKLNDLCFSNVYADSFWKGDDVRQI 

APGQTGVIADYNYKLPDDFMGCVLAWN 

TRNIDATSTGNYNYKYRYLRHGKLRPFE 

RDISNVPFSPDGKPCTPPALNCYWPLND 

YGFYTTTGIGYQPYRWVLSFELLNAPA 

TVCGPKLSTDLIKNQCVNFNFNGLTGTG 

VLTPSSKRFQPFQQFGRDVSDFTDSVRDP 

KTSEILDISPCAFGGVSVITPGTNASSEVA 

VLYQDVNCTDVSTAIHADQLTPAWRIYS 

TGNNVFQTQAGCLIGAEHVDTSYECDIPI 

GAGICASYHTVSLLRSTSQKSIVAYTMSL 

GADSSIAYSNNTIAIPTNF55T5?TTTFVl\/rPV<? 

MAKTSVDCNMYICGDSTECANLLLQYG 
SFCTQLNRALSGIAAEQDRNTREVFAQV 
KQMYKTPTLKYFGGFNFSQILPDPLKPT 
KRSFI 
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SEQID 
NUMBER 


Amino Acid 
Position 
Relative to SEQ 
ID NO: 1 


Amino Acid Sequence 

Amino acid residues in bold (amino acid 
residues 1-16) identify a signal sequence. 


41 


1-1000 


MFIFLLFLTLTSGSDLDRCTTFDDVQAP 

NYTQHTSSMRGVYYPDEIFRSDTLYLTQ 

DLFLPFYSNVTGFHTINHTFGNPVIPFKD 

GIYFAATEKSNWRGWVFGSTMNNKSQ 

SVniNNSTNVVIRACNFELCDNPFFAVSK 

PMGTQTHTMIFDNAFNCTFEYISDAFSLD 

VSEKSGNFKHLREFVFKNKDGFLYVYK 

GYQPIDWRDLPSGFNTLKPIFKXPLGINI 

TNFRAILTAFSPAQDIWGTSAAAYFVGY 

LKPTTFMLKYDENGTITDAVDCSQNPLA 

ELKCSVKSFEIDKGIYQTSNFRVVPSGDV 

VRFPNITNLCPFGEVFNATKFPSVYAWE 

RKKISNCVADYSVLYNSTFFSTFKCYGV 

SATKLNDLCFSNVYADSFWKGDDVRQI 

APGQTGVIADYNYKLPDDFMGCVLAWN 

TRNIDATSTGNYNYKYRYLRHGKLRPFE 

RDISNVPFSPDGKPCTPPALNCYWPLND 

YGFYTTTGIGYQPYRVVVLSFELLNAPA 

TVCGPKLSTDLIKNQCVNFNFNGLTGTG 

VLTPSSKRFQPFQQFGRDVSDFTDSVRDP 

KTSEILDISPCAFGGVSVITPGTNASSEVA 

VLYQDVNCTDVSTAIHADQLTPAWRIYS 

TGNNVFQTQAGCLIGAEHVDTSYECDIPI 

GAGICASYHTVSLLRSTSQKSIVAYTMSL 

GADSSIAYSNNTIAIPTNFSISITTEVMPVS 

MAKTSVDCmiYICGDSTECANLLLQYG 

SFCTQLNRALSGIAAEQDRNTREVFAQV 

KQMYKTPTLKYFGGFNFSQILPDPLKPT 

KRSFIEDLLFNKVTLADAGFMKQYGECL 

GDINARDLICAQKFNGLTVLPPLLTDDMI 

AAYTAALVSGTATAGWTFGAGAALQIP 

FAMOMAYRFNGTGVTDINA/T VFMOI^'nTA 

NQFNKAISQIQESLTTTSTALGKLQDVVN 
QNAQALNTLVKQLSSNFGAISSVLNDILS 
RLDKVEAEVQIDRLITGRLQSLQTYVTQ 
QLIRAAEI 
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SEQID 
NUMBER 


Amino Acid 
Position 
Relative to SEQ 
ED NO: 1 


Amino Acid Sequence 

Amino acid residues in bold (amino acid 
residues 1-16) identify a signal sequence. 


42 


1-1189 


MFEFXrLFLTLTSGSDLDRCTTFDDVQAP 

NYTQHTSSMRGVYYPDEIFRSDTLYLTQ 

DLFLPFYSNVTGFHTINHTFGNPVIPFKD 

GIYFAATEKSNWRGWVFGSTMNNKSQ 

SVniNNSTNVVIRACNFELCDNPFFAVSK 

PMGTQTHTMIFDNAFNCTFEYISDAFSLD 

VSEKSGNFKHLREFVFKNKDGFLYVYK 

GYQPIDWRDLPSGFNTLKPIFKLPLGINI 

TNFRAILTAFSPAQDIWGTSAAAYFVGY 

LKPTTFMLKYDENGTITDAVDCSQNPLA 

ELKCSVKSFEIDKGIYQTSNFRVVPSGDV 

VRFPNITNLCPFGEVFNATKFPSVYAWE 

RKKISNCVADYSVLYNSTFFSTFKCYGV 

SATKLNDLCFSNVYADSFWKGDDVRQI 

APGQTGVIADYNYKLPDDFMGCVLAWN 

TRNIDATSTGNYNYKYRYLRHGKLRPFE 

RDISNVPFSPDGKPCTPPALNCYWPLND 

YGFYTTTGIGYQPYRWVLSFELLNAPA 

TVCGPKLSTDLIKNQCVNFNFNGLTGTG 

VLTPSSKRFQPFQQFGRDVSDFTDSVRDP 

KTSEILDISPCAFGGVSVITPGTNASSEVA 

VLYQDVNCTDVSTAIHADQLTPAWRIYS 

TGNNVFQTQAGCLIGAEHVDTSYECDIPI 

GAGICASYHTVSLLRSTSQKSIVAYTMSL 

GADSSIAYSNNTIAIPTNFSISITTEVMPVS 

MAKTSVDCNMYICGDSTECANLLLQYG 

SFCTQLNRALSGIAAEQDRNTREVFAQV 

KQMYKTPTLKYFGGFNFSQILPDPLKPT 

KRSFIEDLLFNKVTLADAGFMKQYGECL 

GDINARDLICAQKFNGLTVLPPLLTDDMI 

AAYTAALVSGTATAGWTFGAGAALQIP 

FAMQMAYRFNGIGVTQNVLYENQKQIA 

NQFNKAISQIQESLTTTSTALGKLQDVVN 

QNAQALNTLVKQLSSNFGAISSVLNDILS 

RLDKVEAEVQIDRLITGRLQSLQTYVTQ 

QLIRAAEIRASANLAATKMSECVLGQSK 

RVDFCGKGYHLMSFPQAAPHGWFLHV 

TYVPSQERNFTTAPAICHEGKAYFPREG 

VFVFNGTSWFITQRNFFSPQHTTDNTFVS 

GNCDWIGIINNTVYDPLQPELDSFKEEL 

DKYFBCNHTSPDVDLGDISGINASWNIQ 

KEIDRLNEVAKNLNESLIDLQELGKYEQ 


43 


17-100 


DRCTTFDDVQAPNYTQHTSSMRGVYYP 

DEIFRSDTLYLTQDLFLPFYSNVTGFHTI 

NHTFGNPVJPFKDGIYFAATEKSNWRG 
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SEQID 
NUMBER 


Amino Acid 
Position 
Relative to SEQ 
ID NO: 1 


Amino Acid Sequence 

Amino acid residues in bold (amino acid 
residues 1-16) identify a signal sequence. 


44 


17-200 


DRCTTFDDVQAPNYTQHTSSMRGVYYP 

DEIFRSDTLYLTQDLFLPFYSNVTGFHTI 

NHTFGNPVIPFKDGIYFAATEKSNVVRG 

WFGSTMNNKSQSVniNNSTNVVIRACN 

FELCDNPFFAVSKPMGTQTHTMIFDNAF 

NCTFEYISDAFSLDVSEKSGNFKHLREFV 

FKNKDGFLYVYKGY 
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17-400 


DRCTTFDDVQAPNYTQHTSSMRGVYYP 

DEIFRSDTLYLTQDLFLPFYSNVTGFHTI 

NHTFGNPVIPFKDGIYFAATEKSNVVRG 

WVFGSTMNNKSQSVinNNSTNVVIRACN 

FELCDNPFFAVSKPMGTQTHTMIFDNAF 

NCTFEYISDAFSLDVSEJCSGNFKHLREFV 

FKNKDGFLYVYKGYQPIDWRDLPSGFN 

TLKPIFKLPLGINITNERAILTAFSPAQDIW 

GTSAAAYFVGYLKPTTFMLKYDENGTIT 

DAVDCSQNPLAELKCSVKSFEJDKGIYQ 

TSNFRVVPSGDWRFPNITNLCPFGEVFN 

ATKFPSVYAWERKKISNCVADYSVLYNS 

TFFSTFKCYGVSATKLNDLCFSNVYADS 

FWKGDDVRQIAPG 
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17-600 


DRCTTFDDVQAPNYTQHTSSMRGVYYP 

DEIFRSDTLYLTQDLFLPFYSNVTGFHTI 

NHTFGNPVIPFKDGriTAATEKSNVVRG 

WVFGSTMNNKSQSVinNNSTNVVIRACN 

FELCDNPFFAVSKPMGTQTHTMIFDNAF 

NCTFEYISDAFSLDVSEKSGNFKHLREFV 

FKNKDGFLYVYKGYQPIDWRDLPSGFN 

TLKPIFKLPLGINITNFRAILTAFSPAQDIW 

GTSAAAYFVGYLKPTTFMLKYDENGTIT 

DAVDCSQNPLAELKCSVKSFEIDKGIYQ 

TSNFRVVPSGDWRFPNITNLCPFGEVFN 

ATPCFPSVYAWERKKISNCVADYSVLYNS 

TFFSTFKCYGVSATKLNDLCFSNWADS 

FVVKGDDVRQIAPGQTGVIADYNYKLPD 

DFMGCVLAWNTRNIDATSTGNYNYKYR 

YLRHGKLRPFERDISNVPFSPDGKPCTPP 

-/\J_^iN^ X Wir^i^iNU i \jr Y 1 1 1 Cjrivjx C^Jr YJKV V 

VLSFELLNAPATVCGPKLSTDLKNQCV 
NFNFNGLTGTGVLTPSSKRFQPFQQFGR 
DVSDFTDSVRDPKTSEILDISPCAFGGVS 
VITPGTNASSEVAVLYQD 
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SEQID 
NUMBER 


Amino Acid 
Position 
Relative to SEQ 
ID NO: 1 


Amino Acid Sequence 

Amino acid residues in bold (amino acid 
residues 1-16) identify a signal sequence. 
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17-800 

... 


DRCTTFDDVQAPNYTQHTSSMRGVYYP 

DEIFRSDTLYLTQDLFLPFYSNVTGFHTI 

NHTFGNPVIPFKDGIYFAATEKSNVVRG 

WWGSTMNNKSQSVniNNSTNVVIRACN 

FELCDNPFFAVSKPMGTQTHTMIFDNAF 

NCTFEYISDAFSLDVSEKSGNFKHLREFV 

FKNKDGFLYVYKGYQPIDVVRDLPSGFN 

TLKPIFKLPLGINITNFRAILTAFSPAQDIW 

GTSAAAYFVGYLKPTTFMLKYDENGTIT 

DAVDCSQNPLAELKCSVKSFEIDKGIYQ 

TSNFRWPSGDVVRFPNITNLCPFGEVFN 

ATKFPSVYAWERKKISNCVADYSVLYNS 

TFFSTFKCYGVSATKLNDLCFSNVYADS 

FWKGDDVRQIAPGQTGVIADYNYKLPD 

DFMGCVLAWNTRMDATSTGNYNYKYR 

YLRHGKLRPFERDISNVPFSPDGKPCTPP 

ALNCYWPLNDYGFYTTTGIGYQPYRW 

VLSFELLNAPATVCGPKLSTDLIKNQCV 

NFNFNGLTGTGVLTPSSKRFQPFQQFGR 

DVSDFTDSVRDPKTSEILDISPCAFGGVS 

VITPGTNASSEVAVLYQDVNCTDVSTAI 

HADQLTPAWRIYSTGNNVFQTQAGCLIG 

AEHVDTSYECDIPIGAGICASYHTVSLLR 

olkSlJJvMVAY IMkSLCjAUiSiSlAYSNNTlAIP 

TNFSISITTEVMPVSMAKTSVDCNMYICG 

DSTECANLLLQYGSFCTQLNRALSGIAA 

EQDRNTREVFAQVKQMYKTPTLKYFGG 

FNFSQELPDPLKPTKRSFI 



, 64 



wo 2005/010034 



PCT/US2004/023345 



SEQID 
NUMBER 


Amino Acid 
Position 
Relative to SEQ 
ID NO: 1 


Amino Acid Sequence 

Amino acid residues in bold (amino acid 
residues 1-16) identify a signal sequence. 


48 


17-1000 


DRCTTFDDVQAPNYTQHTSSMRGVYYP 

DEIFRSDTLYLTQDLFLPFYSNVTGFHTI 

NHTFGNPVIPFKDGIYFAATEKSNWRG 

WVFGSTMNNKSQSVinNNSTNWIRACN 

FELCDNPFFAVSKPMGTQTHTMIFDNAF 

NCTFEYISDAFSLDVSEKSGNFKHLREFV 

FKNKDGFLYVYKGYQPIDWRDLPSGFN 

TLKPIFKLPLGINITNFRAILTAFSPAQDIW 

GTSAAAYFVGYLKPTTFMLKYDENGTIT 

DAVDCSQNPLAELKCSVKSFEIDKGIYQ 

TSNFRVVPSGDVVRFPNITNLCPFGEVFN 

ATKFPSVYAWERKKISNCVADYSVLYNS 

TFFSTFKCYGVSATKLNDLCFSNVYADS 

FVVKGDDVRQIAPGQTGVIADYNYKLPD 

DFMGCVLAWNTRNIDATSTGNYNYKYR 

YLRHGKLRPFERDISNVPFSPDGKPCTPP 

ALNCYWPLNDYGFYTTTGIGYQPYRW 

VLSFELLNAPATVCGPKLSTDLIKNQCV 

NFNFNGLTGTGVLTPSSKRFQPFQQFGR 

DVSDFTDSVRDPKTSEILDISPCAFGGVS 

VITPGTNASSEVAVLYQDVNCTDVSTAI 

HADQLTPAWRIYSTGNNVFQTQAGCLIG 

AEHVDTSYECDIPIGAGICASYHTVSLLR 

STSQKSIVAYTMSLGADSSIAYSNNTIAIP 

TNFSISITTEVMPVSMAKTSVDCNMYICG 

DSTECANLLLQYGSFCTQLNRALSGIAA 

EQDRNTREVFAQVKQMYKTPTLKYFGG 

FNFSQILPDPLKPTKRSFIEDLLFNKVTLA 

DAGFMKQYGECLGDINARDLICAQKFN 

GLTVLPPLLTDDMIAAYTAALVSGTATA 

GWTFGAGAALOTPF A MOM A VR FT^frTrJV 

TQNVLYENQKQIANQFNKAISQIQESLTT 
TSTALGKLQDVVNQNAQALNTLVKQLS 
SNFGAISSVLNDILSRLDKVEAEVQIDRLI 
TGRLQSLQTYVTQQLIRAAEI 
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SEQID 
NUMBER 


Amino Acid 
Position 
Relative to SEQ 
ID NO: 1 


Amino Acid Sequence 

Amino acid residues in bold (amino acid 
residues 1-16) identify a signal sequence. 


49 


17-1189 


DRCTTFDDVQAPNYTQHTSSMRGVYYP 

DEIFRSDTLYLTQDLFLPFYSNVTGFHTI 

NHTFGNPVIPFKDGIYFAATEKSNVVRG 

WWGSTMNNKSQSVniNNSTNVVIRACN 

FELCDNPFFAVSKPMGTQTHTMIFDNAF 

NCTFEYISDAFSLDVSEKSGNFKHLREFV 

FKNKDGFLYVYKGYQPIDWRDLPSGFN 

TLKPIFKLPLGINITNFRAILTAFSPAQDIW 

GTSAAAYFVGYLKPTTFMLKYDENGTIT 

DAVDCSQNPLAELKCSVKSFEIDKGIYQ 

TSNFRWPSGDWRFPNITNLCPFGEVFN 

ATKFPSVYAWERKKISNCVADYSVLYNS 

TFFSTFKCYGVSATKLNDLCFSNVYADS 

FWKGDDVRQIAPGQTGVIADYNYKLPD 

DFMGCVLAWNTKNIDATSTGNYNYKYR 

YLRHGKLRPFERDISNVPFSPDGKPCTPP 

ALNCYWPLNDYGFYTTTGIGYQPYRW 

VLSFELLNAPATVCGPKLSTDLIKNQCV 

NFNFNGLTGTGVLTPSSKRFQPFQQFGR 

DVSDFTDSVRDPKTSEILDISPCAFGGVS 

VITPGTNASSEVAVLYQDVNCTDVSTAI 

HADQLTPAWRIYSTGNNVFQTQAGCLIG 

AEHVDTSYECDIPIGAGICASYHTVSLLR 

STSQKSIVAYTMSLGADSSIAYSNNTIAIP 

TNFSISITTEVMPVSMAKTSVDCNMYICG 

DSTECANLLLQYGSFCTQLNRALSGIAA 

EQDRNTREVFAQVKQMYKTPTLKYFGG 

FNFSQILPDPLKPTKRSFIEDLLFNKVTLA 

DAGFMKQYGECLGDINARDLICAQKFN 

GLTVLPPLLTDDMIAAYTAALVSGTATA 

GWTFGAGAALQIPFAMQMAYRFNGIGV 

TQNVLYENQKQIANQFNKAISQIQESLTT 

TSTALGKLQDWNQNAQALNTLVKQLS 

SNFGAISSVLNDILSRLDKVEAEVQIDRLI 

TGRLQSLQTYVTQQLIRAAEIRASANLA 

ATKMSECVLGQSKRVDFCGKGYHLMSF 

PQAAPHGWFLHVTYVPSQERNFTTAPA 

ICHEGKAYFPREGVFVFMrrT<?WTTTTORxrp' 

FSPQnTTDNTFVSGNCDWIGIINNTVYD 
PLQPELDSFKEELDKYFKNHTSPDVDLG 
DISGINASVVNIQKEIDRLNEVAKNLNBS 
LDDLQELGKYEQ 
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SEQID 
NUMBER 


Amino Acid 
Position 
Relative to SEQ 
ID NO: 1 


Amino Acid Sequence 

Amino acid residues in bold (amino acid 
residues 1-16) identify a signal sequence. 


50 


17-276 


DRCTTFDDVQAPNYTQHTSSMRGVYYP 

DEIFRSDTLYLTQDLFLPFYSNVTGFHTI 

NHTFGNPVIPFKDGIYFAATEKSNVVRG 

WVFGSTMNNKSQSVmNNSTNWIRACN 

FELCDNPFFAVSKPMGTQTHTMIFDNAF 

NCTFEYISDAFSLDVSEKSGNFKHLREFV 

FKNKDGFLYVYKGYQPIDWRDLPSGFN 

TLK^IFKLPLGINITNFRAILTAFSPAQDrW 

GTSAAAYFVGYLKPTTFMLKYDENGTIT 

DAV 


51 


17-446 


DRCTTFDDVQAPNYTQHTSSMRGVYYP 

DEIFRSDTLYLTQDLFLPFYSNVTGFHTI 

NHTFGNPVIPFKDGnTAATEKSNWRG 

WVFGSTMNNKSQSVniNNSTNVVIRACN 

FELCDNPFFAVSKPMGTQTHTMIFDNAF 

NCTFEYISDAFSLDVSEKSGNFKHLREFV 

FKNKDGFLYVYKGYQPIDWRDLPSGFN 

TLKPIFKLPLGINITNFRAILTAFSPAQDIW 

GTSAAAYFVGYLKPTTFMLKYDENGTIT 

DAVDCSQNPLAELKCSVKSFEIDKGn^Q 

TSNFRWPSGDVVRFPNITNLCPFGEVFN 

AlJsJ:<±'o V Y AWJiKJsJsJiSJNCVAiJYb VLYNo 

TFFSTFKCYGVSATKLNDLCFSNVYADS 

FWKGDDVRQIAPGQTGVIADYNYKLPD 

DFMGCVLAWNTRNIDATSTGNYNYKYR 

YLRHG 
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SEQID 
NUMBER 


Amino Acid 
Position 
Relative to SEQ 
ID NO: 1 


Amino Acid Sequence 

Amino acid residues in bold (amino acid 
residues 1-16) identify a signal sequence. 
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17-537 


DRCTTFDDVQAPNYTQHTSSMRGVYYP 

DEIFRSDTLYLTQDLFLPFYSNVTGFHTI 

NHTFGNPVIPFKDGIYFAATEKSNVVRG 

WVFGSTMNNKSQSVIIINNSTNWIRACN 

FELCDNPFFAVSKPMGTQTHTMIFDNAF 

NCTFEYISDAFSLDVSEKSGNFKHLREFV 

FKNKDGFLYVYKGYQPIDWRDLPSGFN 

TLKPIFKLPLGINITNFRAILTAFSPAQDIW 

GTSAAAYFVGYLKPTTFMLKYDENGTIT 

DAVDCSQNPLAELKCSVKSFEIDKGIYQ 

TSNFRWPSGDVVRFPNITNLCPFGEVFN 

ATKFPSVYAWERKKISNCVADYSVLYNS 

TFFSTFKCYGVSATKLNDLCFSNVYADS 

FWKGDDVRQIAPGQTGVIADYNYKLPD 

urjyL\jsK^ V ju/\. w xM 1 JnJN JJJA l o i LrJN Y JN Y Js. Y K. 

YLRHGKLRPFERDISNVPFSPDGKPCTPP 

ALNCYWPLNDYGFYTTTGIGYQPYRW 

VLSFELLNAPATVCGPKLSTDLIKNQCV 

NFNFNGLTGTGV 
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NUMBER 


Amino Acid 
Position 
Relative to SEQ 
ID NO: 1 


Amino Acid Sequence 

Amino acid residues in bold (amino acid 
residues 1-16) identify a signal sequence. 


53 


17-757 plus an 
N-terminal 
mouse K chain 
leader sequence 
and a C- 
tenninal myc 
epitope and a 
poly histidine 
tag 


METDTLLLWVLLLWVPGSTGDDRCTTF 

DDVQAPNYTQHTSSMRGVYYPDEIFRSD 

TLYLTQDLFLPFYSNVTGFHTINHTFGNP 

VIPFKDGIYFAATEKSNWRGWVFGSTM 

NNKSQSVIIINNSTNWIRACNFELCDNPF 

FAVSKPMGTQTHTMIFDNAFNCTFEYIS 

DAFSLDVSEKSGNFKHLREFVFKNKDGF 

LYVYKGYQPIDWRDLPSGFNTLKPIFKL 

PLGESriTNFRAILTAFSPAQDIWGTSAAA 

YFVGYLKPTTFMLKYDENGTITDAVDCS 

QNPLAELKCSVKSFEIDKGrYQTSNFRVV 

PSGDWRFPNITNLCPFGEVFNATKFPSV 

YAWERKKISNCVADYSVLYNSTFFSTFK 

CYGVSATKLNDLCFSNVYADSFWKGD 

DVRQLAPGQTGVIADYNYKLPDDFMGC 

VLAWNTRNIDATSTGNYNYKYRYLRHG 

KLRPFERDISNVPFSPDGKPCTPPALNCY 

WPLNDYGFYTTTGIGYQPYRVWLSFEL 

LNAPATVCGPKLSTDLKNQCVNFNFNG 

LTGTGVLTPSSKRFQPFQQFGRDVSDFT 

DSVRDPKTSEILDISPCAFGGVSVITPGTN 

ASSEVAVLYQDVNCTDVSTAIHADQLTP 

AWRIYSTGNNVFQTQAGCLIGAEHVDTS 

I xn^uifHaAljiL.Ab Y rl 1 V c»J_,J_,KIS 1 SQKSlV 

AYTMSLGADSSIAYSNNTIAIPTNFSISITT 
EVMPVSMAKTSVDCNMYICGDSTECAN 
LLLQYGSFCTQLNRALSGIAAEQEQKLIS 
EEDLHHHHHH 


54 


17-276 plus an 
N-teiminal 
mouse K chain 
leader sequence 
andaC- 
terminal myc 
epitope and a 
poly histidine 
tag 


METDTLLLWVLLLWVPGSTGDDRCTTF 

DDVQAPNYTQHTSSMRGVYYPDEIFRSD 

TLYLTQDLFLPFYSNVTGFHTINHTFGNP 

VIPFKDGIYFAATEKSNWRGWVFGSTM 

NNKSQSVniNNSTNVVIRACNFELCDNPF 

FAVSKPMGTQTHTMIFDNAFNCTFEYIS 

DAFSLDVSEKSGNFKHLREFVFKNKDGF 

LYVYKGYQPIDWRDLPSGFNTLKPIFKL 

PLGINITNFRAILTAFSPAQDIWGTSAAA 

YFVGYLKPTTFMLKYDENGTITDAVEQK 

LISEEDLHHHHHH 



69 



wo 2005/010034 



PCT/US2004/023345 



SEQID 
NUMBER 


Amino Acid 
Position 
Relative to SEQ 
ID NO: 1 


Amino Acid Sequence 

Amino acid residues in bold (amino acid 
residues 1-16) identify a signal sequence. 


55 


17-537 plus an 
N-terminal 
mouse K chain 
leader sequence 
and a C- 
terminal myc 
epitope and a 
poly histidine 
tag 


METDTLLLWVLLLWVPGSTGDDRCTTF 

DDVQAPNYTQHTSSMRGVYYPDEIFRSD 

TLYLTQDLFLPFYSNVTGFHTINHTFGNP 

VIPFKDGIYFAATEKSNWRGWVFGSTM 

NNKSQSVniNNSTNWIRACNFELCDNPF 

FAVSKPMGTQTHTMIFDNAFNCTFEYIS 

DAFSLDVSEKSGNFKHLREFVFKNKDGF 

LYVYKGYQPIDWRDLPSGFNTLKPIFKL 

PLGINITNFRAILTAFSPAQDIWGTSAAA 

YFVGYLKPTTFMLKYDENGTITDAVDCS 

QNPLAELKCSVKSFEIDKGIYQTSNFRW 

PSGDWRFPNITNLCPFGEVFNATKFPSV 

YAWERKKISNCVADYSVLYNSTFFSTFK 

CYGVSATKLNDLCFSNVYADSFWKGD 

DVRQIAPGQTGVIADYNYBaLPDDFMGC 

VLAWNTRNIDATSTGNYNYKYRYT RVfCl 

KLRPFERDISNVPFSPDGKPCTPPALNCY 

WPLNDYGFYTTTGIGYQPYRVVVLSFEL 

LNAPATVCGPKLSTDLIKNQCVNFNFNG 

LTGTGV EQKLISEEDLHHHHHH 
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SEQID 
NUMBER 


Amino Acid 
Position 
Relative to SEQ 
ID NO: 1 


Amino Acid Sequence 

Amino acid residues in bold (amino acid 
residues 1-16) identify a signal sequence. 


56 


17-756 

N-terminal 
without a signal 
peptide 


DRCTTFDDVQAPNYTQHTSSMRGVYYP 

DEIFRSDTLYLTQDLFLPFYSNVTGFHTI 

NHTFGNPVIPFBCDGIYFAATEKSNWRG 

WWGSTMNNKSQSVniNISrSTNVVIRACN 

FELCDNPFFAVSKPMGTQTHTMIFDNAF 

NCTFEYISDAFSLDVSEKSGNFKHLREFV 

FKNKDGFLYVYKGYQPIDWRDLPSGFN 

TLKPIFKLPmiNrrNFRAILTAFSPAQDIW 

GTSAAAYFVGYLKPTTFMLKYDENGTIT 

DAVDCSQNPLAELKCSVKSFEIDKGIYQ 

TSNFRWPSGDWRFPNITNLCPFGEVFN 

ATKFPSVYAWERKKISNCVADYSVLYNS 

TFFSTFKCYGVSATKLNDLCFSNVYADS 

FWKGDDVRQIAPGQTGVIADYNYKLPD 

DFMGCVLAWNTRNIDATSTGNYNYKYR 

YLRHGKLRPFERDISNVPFSPDGKPCTPP 

ALNCYWPLNDYGFYTTTGIGYQPYRVV 

VLSFELLNAPATVCGPKLSTDLIKNQCV 

NFNFNGLTGTGVLTPSSKRFQPFQQFGR 

DVSDFTDSVRDPKTSEILDISPCAFGGVS 

VITPGTNASSEVAVLYQDVNCTDVSTAI 

HADQLTPAWRIYSTGNNVFQTQAGCLIG 

A±!,MVlJi5>YJiL.JJil:'ilj^A(jlCAiS YMl VSLi/R 

STSQKSIVAYTMSLGADSSIAYSNNTIAIP 

TNFSISITTEVMPVSMAKTSVDCNMYICG 

DSTECANLLLQYGSFCTQLNRALSGIAA 

E 


57 


272-537 


ITDAVDCSQNPLAELKCSVKSFEIDKGIY 

QTSNFRVVPSGDVVRFNITNLCPFGEVFN 

ATKFPSVYAWERKKISNCVADYSVLYNS 

TFFSTFKCYGVSATKLNDLCFSNVYADS 

FWKGDDVRQIAPGQTGVIADYNYKLPD 

DFMGCVLAWNTRNIDATSTGNYNYKYR 

YLRHGKLRPFERDISNVPFSPDGKPCTPP 

ALNCYA¥PLNDYGFYTTTGIGY(!jPYRW 

VLSFELLNAPATVCGPKLSTDLIKNQCV 

NFNFNGLTGTGV 


58 


24-39 

D24 peptide 


DVQAPNYTQH TSSMRGC 


59 


540-555 
P540 peptide 


PSSKRFQPFQQFGRDC 
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SEQID 
NUMBER 


Amino Acid 
Position 
Relative to SEQ 
ID NO: 1 


Amino Acid Sequence 

Amino acid residues in bold (amino acid 
residues 1-16) identify a signal sequence. 


60 


1-16 
spike 

signal sequence 


MEIFLLFLTLTSGSDL 


61 


303-537 

containing the 
receptor binding 
domain 


SNFRWPSGDWRFPNITNLCPFGEVFNA 

TKFPSVYAWERKKISNCVADYSVLYNST 

FFSTFKCYGVSATKLNDLCFSNVYADSF 

WKGDDVRQIAPGQTGVIADYNYKLPD 

DFMGCVLAWNTRNIDATSTGNYNYXYR 

YLRHGKLRPFERDISNVPFSPDGKPCTPP 

ALNCYWPLNDYGFYTTTGIGYQPYR.VV 

VLSFELLNAPATVCGPKLSTDLIKNQCV 

NFNFNGLTGTGV 


62 


319-517 
containing the 
receptor binding 
domain 


ITNLCPFGEVFNATKFPSVYAWERKKISN 

CVADYSVLYNSTFFSTFKCYGVSATKLN 

DLCFSNVYADSFVVKGDDVRQIAPGQT 

GVIADYNYKLPDDFMGCVLAWNTRNID 

ATSTGNYNYKYRYLRHGKLRPFERDISN 

VPFSPDGKPCTPPALNCYWPLNDYGFYT 

TTGIGYQPYRWVLSFELLNAPATVCGP 

KLST 


63 


319-518 
containing the 
receptor binding 
domain 


ITNLCPFGEVFNATKFPSVYAWERKXISN 

CVADYSVLYNSTFFSTFKCYGVSATKLN 

DLCFSNVYADSFVVKGDDVRQIAPGQT 

GVL\DYNYKLPDDFMGCVLAWNTRNID 

ATSTGNYNYKYRYLRHGKLRPFERDISN 

VPFSPDGKPCTPPALNCYWPLNDYGFYT 

TTGIGYQPYRWVLSFELLNAPATVCGP 

KLSTD 



Example 6 

Structure of the Spike Protein 
5 To characterize the properties and function of the SARS-CoV S protein, 

nucleic acids encoding the full-length Tor2 isolate were cloned into expression 
vectors as described above. The Tor2 isolate is further described in Marra et al. 
Hie genome sequence of the SARS-associated coronavirus^ Science 300:1399— 
1404 (2003). Clones generated included the full-length S protein (1255 
10 residues), the ectodomain Se (residues 17-1 189) having just the extracellular 
domain of the S protein with the putative transmembrane domain and 
cytoplasmic tail of the spike protein deleted, fragments containing the N- 
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terminal 276 (SEQ ID NO:50), 537 (SEQ ID NO:52), and 756 (SEQ ID NO: 56) 
amino acid residues (S276, S537, and S756, respectively) including a putative 
16-residue signal sequence or a mouse k chain leader sequence, and an internal 
fragment (S272-537) containing residues 272-537 (SEQ ID NO:57) (see Fig. 
5 IB). 

Amino acid residues 758-761 (RNTR) form part of the following general 
motif for cleavage by precursor convertases: 

K/R-Xn-KTR 
where X is any amino acid residue and n == 0, 2, 4 or 6. 

10 The SI subunit is approximately encompassed within the S756 fragment. 

This finding is in agreement with the size of the SI subunit for murine 
coronaviruses, e.g., strain JHM where SI is 769 residues, and for the human 
coronavirus OC43 (778 residues). See Gallagher &. Buchmeier, Coronavirus 
spike proteins in viral entry and pathogenesis^ Virology 279: 371—374 (2001); 

15 Kunkel & Herrler, Structural and functional analysis of the surface protein of 
human coronavirus OC43^ Virology 195 417: 195-202 (1993). However, for 
the human coronavims 229E, SI is considered to consist of a shorter 547 residue 
fragment that corresponds to S537. Bonavia et al.. Identification of a receptor- 
binding domain of the spike glycoprotein of human coronavirus HCoV'229E, J. 

20 Virol. 77: 2530-2538 (2003). 

All S glycoprotein fragments and the full-length S glycoprotein ran on 
SDS-PAGE gels at positions significantly higher than their estimated molecular 
weights, indicating that these polypeptides are likely post-translationally 
modified. The S276 polypeptide had an apparent molecular weight of about 75 

25 kDa, S537 had an apparent molecular weight of about 100-1 10 kDa, S756 had 
an apparent molecular weight of about 130-140 kDa, and Se and S had apparent 
molecular weights of about 200 kDa or higher (Figs. 4 and 6). The bands 
corresponding to these polypeptide were broad even when observed at low 
exposure (Fig. 6; some data not shown). These data indicate significant 

30 glycosylation as observed for the S glycoprotein and fragments thereof. Based 
on approximate estimations of molecular weight it appears that the S2 subunit is ) 
not as heavily glycosylated as S756 (constituting the SI subunit). Notably, S276 
is heavily glycosylated if one assumes that only glycosylation contributes to the 
increased molecular mass. 
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Most of the SARS-CoV S glycoprotein obtained from cell culture 
supematants was not cleaved, although weak bands due to smaller proteins wei 
observed on SDS-PAGE gels. One of these weak bands runs at the same 
position as S756, suggesting the possibihty of inefficient cleavage (Figs. 4 and 
6). Random digestion by proteases may occur and further studies are needed to 
determine if the S glycoprotein cleavage is necessary for its function. 



Example 7 

Expression of pepti de fragments in Escherichia coli 
1 0 A nucleic acid segment encoding a SEQ ID NO:5 1 peptide fragment 

containing amino acid residues 17-446 of SEQ ID NO: 1 was cloned into the 
pRSET vector (Invitrogen, San Diego, CA) to create the plasmid pRSET-S(17- 
446). E. coli BL2 1 DE3 cells were transformed with pRSET-S(l 7-446) and then 
induced with IPTG. The results of the induction are shown in Fig. 2. 

15 

Example 8 

Use of the T 7 promoter to drive expression of a clotiftH 
peptide fragment of the invention 
Human 293 cells or Monkey Vero E6 cells were grown to a density of 
20 1 .2X10^ cells/T25 flask (60 mm dish) in 5 ml of DMEM+10% FBS medium the 
day prior to transfection. The cells were then transfected, using the Polyfect 
(Qiagen) transfection kit-according to the manufacturer's protocol, with 
pSecTag2B constructs (6 ug each) containing inserts coding for the various 
peptide fragments of the spike protein. These constructs were prepared as 
25 described above. 

After 4 hour of transfection, a VTF7.3 vaccinia virus carrying a T7 
polymerase was used to infect the transfected cells at a MOI (multipUcity of 
infection) of 20 (Fuerst et al., Proc. Natl. Acad. Sni. 93:11371 (1986)). This 
procedure provided for the use of the T7 promoter in the pSecTag2B vector 
30 instead of the CMV promoter, which is much weaker (Nussbaum et al., J. Virol. . 
68:541 1 (1994)). After three hours of infection, 1.5 ml of fi^sh medium was 
added to the cells and then the cells were transferred to a 3 1°C incubator. The 
cells were incubated for an additional 24 hours, after which the culture medium 
was collected. 
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No measurable cytopathicity was observed in cells transfected with any 
of the S nucleic acid constructs (data not shown), indicating that the full-length 
and soluble fragments of the S glycoprotein may not have significant cytotoxic 
effects. However, at higher levels of expression such effects are possible and 
5 formation of syncj^a as described below may lead to cell death. 

Example 9 

Spike-Spedfic Antibodies 
New Zealand rabbits were immunized with 0.1 mg of various peptides 

1 0 selected by a computer program for their immunogenicity. Serum from the 
immunized rabbits was tested in ELISA and Western blot for reactivity. Sera 
from rabbits immunized with two peptides exhibited the highest and specific 
activity against the spike glycoprotein and were selected for further study. 
Antibodies denoted D24 and P540 were elicited by the peptides 

1 5 DVQAPNYTQH TSSMRGC (SEQ ID NO:58) and PSSKRFQPFQQFGRDC 
(SEQ ID NO:59), respectively. Another anti-SARS-CoV S glycoprotein 
polyclonal antibody IMG-542, which recognizes amino acid 288-303 of the S 
glycoprotein, was purchased from IMGENEX (San Diego, CA). 

20 Example 10 

Immunoprecipi tation and Purification of Spike PolvDeptides 
Soluble spike polypeptides fragments were obtained fipm the Vera E6 or 
293 cell culture medium. However, the full-length spike glycoprotein was 
detected only in the cell lysate. 

25 Medium from cells transfected with nucleic acids encoding various 

soluble S fragments was collected and subjected to centrifiigation at lOOOg for 
10 min to remove cellular debris. The cleared medium was incubated with either 
Ni-NTA agarose beads (Qiagen, Valencia, CA) or an immunoprecipitating 
antibody plus glycoprotein G-Sepharose beads (Sigma, St. Louis, MO) for 2 h at 

30 4 °C. The beads were then mixed with an equal volume of SDS gel sample 
buffer, boiled for 3 min, and subjected to gel analysis. For fiill-lengfh S 
glycoprotein, cells were lysed first in PBS supplemented with 1% NP-40 and 
O.SmM PMSF for 1 h at 4 °C, and centrifiiged at 14,000 rpm in a table-top 
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Eppendorf centrifuge for 20 min. The cleared lysate was either 
iratnunoprecipitated first or used directly in Western blotting. 

Example 11 

5 Western blotting and slot blots 

Cells expressing the S glycoprotein were lysed first with a PBS-based 
NP40 lysis buffer as described above, and the debris was cleared by 
centrifugation. For soluble S firagments the medium was collected and cleared as 
described above. For slot blots, the cleared lysate or medium firom supematant 

10 was used directly to blot the nitrocellulose membrane following the protocol 

suggested by the manufacturer (Bio-Rad, Hercules, CA) and the membrane was 
subjected to antibody detection as in conventional Westem blotting. For 
Western blotting, a monoclonal anti-c-Myc epitope antibody (Invitrogen, 
Carlsbad, CA) or anti-spike protein rabbit polyclonal antibodies obtained by 

15 immunization of rabbits with spike peptides were diluted in TBST buffer. 
Antibodies were incubated with the membrane for 2 h, washed and then the 
membrane was incubated with a secondary antibody conjugated with HRP for 1 
h, washed four times (each time for 15 min), and then developed using the ECL 
reagent (Pierce, Rockford, IL). 

20 

Example 12 

Cell-binding assay and ELISA 
Medium containing soluble S fi-agments was collected and cleared by 
centrifugation. Vero E6 or other cells (5x10^) were incubated with 0.5 ml of 
25 cleared medium containing soluble S fragments and 2 fig of anti-c-Myc epitope ^ 
antibody conjugated with HRP at 4 ""C for 2 h. Cells were then washed three 
times with ice-cold PBS and collected by centrifugation. The cell pellets were 
incubated with ABTS substrate from Roche (Indianapolis, IN) at RT for 10 min, 
the substrate was cleared by centrifugation, and the optical density at 405 nm 
30 was measured. The result of the slot blot analysis is presented in Fig. 4 and 
discussed in further detail below. 

For ELISA, purified ACE2 (R4&D, Minneapolis, MM) was adsorbed onto 
Maxisorp ELISA plates in pH 9.6 buffer at a concentration of 100 ng per well. 
Medium 154 (150 /xl) containing various soluble S fi-agments and 0.6 fig of anti- 
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c-155 Myc epitope antibodies conjugated with HRP were incubated in each well 
at 37 ""C for 2 h. Wells were washed and 60 jLtl of ABTS substrate was added to 
each well. The optical density (OD405) was measured 20 min later. 

5 Example 13 

Fluorescent dve redistribution cell fusion assay 
HeLa or 293T cells, transfected with plasmids encoding the S 
glycoprotein, were loaded with Calcein AM (Molecular Probes), which is 
converted within the cells to calcein green. The cells were incubated in medium 

10 containing 1 fig/ml Calcein AM for 1 h at 37 °C and 5% C02, and then washed 
and re-suspended in fresh medium. Plated target cells, Vero E6, were stained 
with CMAC (Molecular Probes) by incubation in 1 /xg/ml CMAC in medium for 
30 min at 37 and 5% CO2. The cells were then washed twice with medium, 
incubated for 20 min in fresh medium, washed again, and then covered with 0.5 

15 ml medium per well. The S-expressing cells, loaded with calcein, were added to 
the target cells and incubated for 1, 2, or 4 h at 37 °C and 5% CO2. Fusion was 
measured as the ratio between the cells that have double staining and the total 
number of target cells in contact with an S glycoprotein-expressing cell. 
Microphotographs were taken using the MethaMorph 4.0 software from 

20 Universal Imaging. 



Example 14 

/3-Galactosidase reporter gene-based cell-cell fiision assay 
293T cells (1.5x10^) were plated in T25 flasks. The next day, these cells 

25 were separately transfected with pCDNA3-S, pSectag2B-S, pCDNA3-ACE2, 
and pCDNA3-ACE2-Ecto using the Polyfect transfection kit (Qiagen, Valencia, 
CA) following the manufacturer's suggested protocol. Four hours after 
transfection, cells transfected with S constructs were infected with T7 
polymerase-expressing vaccinia virus VTF7.3 and cells transfected with ACE-2 

30 constmcts were infected with jS-gal encoding vaccinia virus (VCB21R). Two 
hours after infection, cells were incubated with fresh medium and transferred to 
31 ^'C for overnight incubation. The next day S glycoprotein-expressing cells and 
ACE-2-expressing cells were mixed in a 1 : 1 ratio and incubated at 37 ''C. Three 
hours later, cells were lysed by adding NP-40 to a final concentration of 0.5%. 
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Cell lysate (50 fil) was mixed with equal volume of CPRG substrate and OD595 
was measured 1 hr later. 

Example 15 

5 Expression of Spike Polypeptides in Mammalian Cells 

For certain experiments, all proteins except the full-length S glycoprotein 
were tagged with a c-Myc epitope and a histidine tag. These proteins were 
expressed in 293 and Vero E6 cells after transfection with the corresponding 
plasmids followed by infection with vaccinia vims-expressing T7 polymerase. 

10 The tagged proteins were detected by using an anti-c-Myc monoclonal 

antibody (Fig. 4). As shown in Fig. 4, the T7 promoter was a highly efficient 
promoter for expression of the S glycoprotein. In these experiments, the T7 
promoter gave rise to higher levels of expression than the CMV promoter, which 
under most circimistances is a strong promoter (Fig. 4A). As shown in Fig. 4A, 

15 the S fragments were soluble and their concentration in the culture supematants 
was inversely proportional to their size. 

Example 16 

Anti-Spike Antibodies 

20 To be able to detect unlabeled proteins, validate the data obtained by the 

anti-c-Myc antibody, and localize possible antigenic sites rabbit polyclonal 
antibodies were developed. Two of these antibodies, D24 and P540, were raised 
against peptides starting at residues 24 and 540, respectively. The D24 and P540 
antibody preparations specifically recognized certain soluble fragments (Fig. 

25 4C). As expected, D24 recognized all fragments; P540 recognized S756, Se, and 
S but not the smaller fragments (Fig. 4C; some data not shown). The D24 
antibody preparation was relatively weak. However, the P540 preparation was 
very sensitive even at dilution 1:10,000 and was used extensively in the 
experiments described herein. 

30 The P540 antibody preparation was used to detect whether the S 

glycoprotein was expressed intracellularly, extracellularly or on the cell surface. 
As shown in Fig. 5, the full-length S glycoprotein was expressed at the cell 
surface, although at low levels, as measured by flow cytometry. 
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Example 17 
Spike Protein Mediates Cell Fusion 
The fUll-length S glycoprotein mediates fusion at neutral pH with cells 
expressing receptor molecules. Cell-cell fusion assays were performed to 
5 confirm that the full-length recombinant S glycoprotein was functional, and to 
ascertain whether the S protein requires other viral proteins and/or low pH for its 
fusion activity. 

Expression of the full-length S glycoprotein with both vectors pCDNA3- 
S and pSectag2B-S, supported fusion with ACE2 expressing cells efficiently, as 
10 evidenced by formation of syncytia of various sizes and by jS-gal reporter gene- 
based assay (Fig. 7), Interestingly, the pSectag2B-S construct in which the S 
glycoprotein leader peptide was replaced by a mouse k chain leader sequence 
induced faster formation of syncytia. Moreover, the syncj^ia formed were larger 
and more numerous than those induced by pCDNA3-S, which encodes the native 

15 S glycoprotein (data not shown). The extent of fusion mediated by S expressed 
from pSectag2B-S was also higher than from pCDNA3-S as measured by a 
reporter gene-based assay (Fig. 7B). These data indicate that the natural S 
glycoprotein may not be efficiently transported to the cell surface. These studies 
also suggest that the jS-gal assay described here can serve as a quick and 

20 quantitative method to identify inhibitors of SAR-CoV entry uito cells, as well 
as a tool to study SARS-CoV entry mechanism. 

Notably, fusion of Vero E6 cells was not detected using the /3-gal assay 
or the sync>1;ium formation assay when the cells were not transfected with 
plasmids encoding ACE2 and the cells expressed only native concentrations of 

25 the receptor. To explore the possibility that this was due to low sensitivity of 
these two assays, another assay was used. This new assay was based on 
fluorescent dye redistribution that is able to detect fusion of single cells. Even 
with this fluorescent-based assay statistically significant differences between 
cells transfected with plasmids encoding the ftill-length S glycoprotein and 

30 various negative controls were not detected. Some of the negative controls 

included transfection with plasmids encoding soluble S fragments at different pH 
(data not shown). Significant cell-cell fusion was only detected when the cells 
were transfected with plasmids encoding ACE2, suggesting that the higher levels 
of receptor expression achieved by expression of recombinant ACE2 could be 
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important for cell-cell fusion. Overall, these results suggest that recombinant S 
glycoprotein can mediate cell fusion, that fusion can occur at neutral pH, and 
that its efficiency is dependent on the concentration of the receptor molecules. 
Moreover, soluble fragments of the S glycoprotein inhibit S-mediated 
5 cell fusion. As shown in Fig. 15, addition of S fragments S272-537 and S17- 
537, which have the receptor binding domain as described below, inhibit S- 
mediated cell fusion. In this assay, the S272-537 (SEQ ID NO:57) fragment, 
exhibited the most inhibition. The SI 7-276 fragment that does not have the 
receptor binding domain exhibited little or no inhibition of S-mediated cell 

10 fusion. These data indicate that S polypeptide fragments that have the receptor 
binding domain could inhibit SARS-CoV fusion with animal cells, thereby 
inhibiting or preventing S ARS-CoV infection. 

Hence, blocking, modulating or inhibiting the activity of the spike 
protein receptor binding domain, with an anti-RBD antibody, S polypeptide, S 

15 peptide or aptamer may be an effective preventive or treatment for SARS-CoV 
infection. 



Example 18 

20 Identification of Spike Protein Receptor-Binding Domain 

This Example illustrates that the Spike protein receptor-binding domain 
is localized within residues 272 to 537 (SEQ ID NO:57), and likely within 
residues 303-537 (SEQ ID NO:61). Later experiments have shown that a 
fragment containing residues 319-517 (SEQ ID NO:62) also has receptor 

25 binding activity. 

An assay based on the binding of various soluble fragments to receptor 
expressing Vero E6 cells was developed to localize the receptor-binding domain 
(ElBD) of the S glycoprotein. This assay involved measurement of fluorescence 
associated with binding of antibodies directed against the S polypeptides to Vero 

30 E6 cells and was developed prior to the identification of the SARS-CoV 
receptor. Vero E6 cells that are susceptible to SARS-CoV infection were 
incubated with full-length S polypeptide and various soluble S fragments. 
Several cell lines that are not susceptide to SARS-CoV infection were similarly 
incubated with full-length S polypeptides and soluble fragments thereof. 
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As shown in Figs. 8A and 8B, all fragments S fragments bound to Vero 
E6 cells except the smallest one S fragment (S276). No such binding was 
detected when several cell lines that are not susceptide to SARS-CoV infection 
were incubated with ftdl-length S polypeptides and soluble fragments thereof. 
5 Binding to Vero E6 cells was proportional to the expression levels of the 
fragments and was approximately inversely proportional to the sizes of the 
fragments. These findings suggested that the RED is localized between residues 
272 and 537. 

To fiirther localize the RBD, an antibody (IMG 542) was used that was 

10 generated using a peptide containing residues 288-303. This antibody did not 
inhibit binding of the S537 fragment to Vero E6 cells although it did bind to the 
S537 fragment (Fig. 8B; some data not shown), suggesting that the RBD is 
localized between residues 303 and 537. Because of the relatively large antibody 
size and the possibility for steric hindrance, it is likely that the RBD is 

1 5 downstream of residue 303 . Recently, the RBD of the HCo V-229E was 

localized to a fragment containing amino acid residues 407—547, Klsiazek et al. 
A novel coronavims associated with severe acute respiratory syndrome, N. 
Engl. J. Med. 348: 1953-1966 (2003); Rota et al. Characterization of a novel 
coronavirus associated with severe acute respiratory syndrome. Science 300: 

20 1394-1399 (2003). In contrast, the RBD for murine hepatitis vims was mapped 
to the N-terminal 330 amino acids. 

It remains to be seen whether there is structural similarity between the 
RBD-containing fragments of the SARS-CoV SI glycoprotein (e.g., S272-537) 
and the HCoV-229E or hepatitis vims RBD, and whether such similarity is 

25 related to the use of the same host for replication. These two vimses use 

different receptors. The straightforward cell-binding approach described here 
could also be helpflil for identification of other vims receptors. 

Recently, workers have reported the identification of ACE2 as a 
fimctional receptor for the SARS-CoV. Li et al. Angiotensin-converting emyme 

30 2 is a functional receptor for the SARS coronavirus. Nature 426: 450-54 (2003). 
The identification of ACE2 as receptor permitted fiirther validation that the 
results provided above are correct. As shown in Fig. 8C, when purified ACE2 is 
used in an ELISA to test for binding, the same binding pattern was observed as 
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for the ceU-binding assay. This was true for all of the S fragments tested (Fig. 
8C). 

The results provided herein not only offer new tools to study entry of the 
SARS virus into cells, confirm that ACE2 is a receptor for the SARS-CoV SI 
glycoprotein and localize the RBD but also facihtate development of novel 
vaccine immunogens and ther^eutics for prevention and treatment of SARS. 

Example 19 

N-terminal and C-terminal Ohgomerizationof the S glycoprotein 
This Example illustrates that the extreme N-terminal fragment of the S 
glycoprotein, upstream from the RBD, may play a role in ftision, and the S 
ectodomain forms trimers that could mediate fusion through six-heUx bundle 
intermediates. 



20 



15 Materials and methods 

Antibodies and plasmids. The rabbit anti-S serum used in Western and 
FACS analyses, P540 was developed by the inventors as described above. See 
also, Xiao et al. Biochem. Biophys. Res. Comm. 312: 1159-65 (2003). The anti- 
Myc epitope antibody was purchased from Invitrogen (Carlsbad, CA). The anti- 
ACE2 goat polyclonal antibody was purchased from R&D system (MinneapoUs. 
MN) and used for detection by Western blotting. 

Site directed mutagenesis was used to create the consensus cleavage sites 
corresponding to that of the HIV-1 envelope glycoprotein (Env) and some 
coronavfruses within the full length SARS-CoV S glycoprotein gene in 
pCDNAS. The QuickChange Kit from Stratagene (La JoUa, CA) was employed 
using the protocol provided by manufacturer. For expression of various N 
terminal S fragments, the corresponding gene fragments were amphfied by PCR 
and cloned into the pSecTag2 expression vector (Invifrogen, Carlsbad). The 
plasmid pCDNA3-ACE2-ecto, which expresses the ACE2 soluble ectodomain 
30 tagged with C9 peptide was kindly provided by Michael Faizan (Harvard 
University, Boston MA). 

Protein expression and purification. Various N tennmal fragments of the 
S glycoprotein were sub-cloned in pSecTag2 expression vector and fransfected 
into 293T cells foUowed by infection with VTF7.3 as described in Xiao et al. 



25 
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Biochem. Biophys. Res. Comm. 312: 1159-65 (2003). The protein expressed 
and secreted into the medium was purified using the HiTrap Ni^-Chelating 
column (Pharmacia) under native conditions. The purified protein was dialyzed 
against PBS buffer and stored for further analysis. 
5 S glycoprotein dimerization and its interaction with ACE2 examined by 

co-immunoprecipitation. For S firagment dimerization, different S glycoprotein 
constructs, alone or in combination, were transfected to 293T cells as described 
in Xiao et al. Biochem. Biophys. Res. Comm. 312: 1 159-65 (2003). Medium 
containing S fi:agments was subjected to immunoprecipitation with rabbit anti-S 
10 polyclonal antiserum P540. For some co-immunoprecipitation experiments, 
DTT was added to create reducing condition to eliminate inter-molecule 
interactions through disulfide bonds. Immimoprecipitated S fi-agments were 
detected by Western using an anti-Myc epitope monoclonal antibodies. Soluble 
ACE2-C9 was expressed similarly. ACE2-C9 secreted into the medium was 
15 used directly for incubation with various S firagments for 2 hours at 4''C. 

Afterwards, ACE2 was immunoprecipitated by incubating with 1D4 anti-C9 
monoclonal antibody and protein G-Sepharose beads at 4**C for one hour. The 
beads were washed four times with PBS, suspended in SDS-PAGE sample 
buffer, boiled for 3 min and subjected to gel separation. The presence of either 
20 ACE2 or S in the sample was examined by Westem as described in Xiao et al. 
Biochem. Biophys. Res. Comm. 312: 1159-65 (2003). 

Flow cytometry. Cells transfected with full length S glycoprotein or S 
glycoprotein with different N terminal deletions and infected with VTF7.3 were 
incubated with the P540 rabbit anti-S polyclonal antibody and goat anti-rabbit 
25 antibody conjugated with FITC in PBS containing 1% BSA at 4°C for two 
hoxars. Cells were then washed four times in ice cold PBS and analyzed with 
FacsCalibur (Becton Dickinson, San Jose, Cahfomia). 

Gel filtration analysis of S fi-agments. After being purified on Ni- 
chelating column and buffer-exchanged to PBS, S firagment samples were loaded 
30 onto a Superose 12 10/300 GL column (Pharmacia, Uppsala, Sweden) that had 
been pre-equilibrated with PBS. The proteins were eluted with PBS at 0.5 
ml/min, and 0.5 ml firactions were collected. The Superose 12 column was 
calibrated with protein molecular mass standard of 669, 440, 232, 158, 67, 44 
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and 25 kD. A 10 ^il aliquot was taken from each fraction for Western blot 
analysis. 

Crosslinking. Purified S537 fragment was diluted to a concentration of 
0.2 lag/ml in PBS. BS^ (Pierce, Rockford, IL) was added to the S537 solution to 
5 a final concentration of 1 mg/ml and incubated on ice for 1 min. The samples 
were then mixed with an equal volume of 4X SDS-PAGE loading buffer and 
analyzed by Western blot. 

Cell fusion fi-gal reporter gene assay. Cells transfected with 
pSecTag2B-S or pCDNA3-ACE2 and infected with VTF7.3 and VCB21R 

1 0 respectively were collected by trypsin digestion and washed once with PBS . 
Cells were then suspended in regular DMEM medium at pH 7.4 and mixed. 
Cells were lysed after four hours of incubation and fi-gal activity was measured 
using CPRG as the substrate (Roche) as described in Xiao et al. Biochem. 
Biophys. Res. Comm. 312: 1159-65 (2003). 

15 ELISA. Two ELISA assays were used. In the sandwich ELISA the plate 

was coated with an anti-His tag antibody, then the S fragment were added and 
detected with an anti-c-Myc epitope antibody. This assay was used for detection 
of the S fragments. In the second ELISA assay the C9 tagged receptor ACE2 
was coated on the plates through an anti-C9 antibody (ID4) and the S fragments 
20 were added and after washing detected with an anti-c-Myc epitope antibody. In 
all experiments the incubations with the c-Myc epitope antibody were for 2 
hours at room temperature. The optical density (OD) was measured and 
normalized to the highest value. 



25 Results 

The N-terminal fragment upstream of the RBD of the S glycoprotein 
forms a dimer. It has been previously shown for another coronavirus (MHV) 
that soluble SI (similar to SU) fragments form dimers, that the extreme N- 
terminal 330 amino acid residue region that contains the receptor binding 
30 domain participates in the dimerization, and that only dimers bind the receptor 
CEACAM, See Lewicki & Gallagher, J. Biol. Chem. 277:19727-34 (2002). 
However, the inventors and others have localized the SARS-CoV receptor 
binding domain downstream from the extreme N-terminus. Xiao et al. Biochem. 
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Biophys. Res. Comm. 312: 1159-65 (2003); Wong et al. J. Biol. Chem, 279: 
3197-3201 (2004); Babcock et al. J. Virol. 78: 4552-4560 (2004). 

To address the possibility of oligomerization by receptor binding 
domain-containing fragments and to assess their function in mediating 
5 membrane fusion, several S fragments were tested for oligomerization. These S 
fragments included the extreme N-terminal fragment (residues 17 through 276 
denoted as S276, SEQ ID NO:50) that does not bind the receptor ACE2, several 
S fragments (S756, S537, S272-537) that bind ACE2, as well as a fragment 
including residues 319 through 517 (denoted as S3 19-517, SEQ ID NO:62) that 

10 retains receptor binding activity. These fragments were selected in part because 
they fold independently and are secreted in the cell culture supernatant, although 
the efficiency of their expression varied significantly (Fig. 9 A, left) and their 
concentration was decreased when co-expressed with 8756 (Fig. 9A, right). 

To find whether any of these fragments oligomerizes with the largest one 

15 (S756) that includes the equivalent of the receptor-binding subunit of the 

envelope glycoproteins (SU in general and SI for coronavimses) the polypeptide 
fragments were coexpressed, and then the mixtures in the cell culture 
supematants were immunoprecipitated with the antibody P540. As described in 
previous Examples, this rabbit polyclonal antibody preparation was developed 

20 against a peptide containing residues 540-555 (SEQ ID NO:59) of the S 

glycoprotein. The P450 antibody binds the S756 polypeptide but not the other 
fragments (Fig. 9B, left). All N-terminal fragments except the smallest fragment 
(S3 19-5 17) containing the receptor binding domain were coimmunoprecipated 
with S756 by P540 (Fig. 9B, right). To rule out the possibility of nonspecific 

25 disulfide bond formation that may lead to coinununoprecipitation, DTT was 

included in one of the coimmunoprecipitation experiments. DTT had no effect 
on either immunoprecipitation or coimmunoprecipitation of secreted S756 (left 
lanes) or S756+S276 (right lanes) (Fig. 9C, left panel). 

To find the size of the oligomers, one of the fragments (S537) was cross- 

30 linked with BS^. The right panel of Fig. 9C shows the appearance of a new band 
with a molecular weight corresponding to a dimer but not of higher order 
oligomers. To exclude the possibility of artifacts due to cross-linking and further 
to confirm the formation of dimers, the S537 fragment was also analyzed by gel 
filtration. Two gel filtration elution peaks were observed: one due to species of 
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molecular weight of about 230 kDa and the other one of about 1 10 kDa (Fig. 
lOA, upper panel) corresponding to a dimer-sized oligomer and a monomer, 
respectively. In contrast, the smallest fragment containing the receptor binding 
domain (S3 1 9-5 1 7) was eluted only as a monomer at about 35 kDa molecular 
5 weight (Fig. 2A, lower panel). Overall, these results suggest that soluble SU is a 
dimer and that the dimerization domain is within the extreme N-terminal region 
upstream from residue 317 and the receptor binding domain. 

The dimeric N terminal region is required for S mediated cell-cell fusion. 
Because the putative dimerization domain is upstream from the receptor binding 
10 domain within SI and the ftision machinery is in S2, one might hypothesize that 
dimerization may not be required for mediation of fusion. To test this 
hypothesis, two deletion mutants of the ftiU-length S glycoprotein were 
generated. The N-terminal 103 residues were deleted from one fragments and 
the N-tenrdnal 311 residues were deleted from another (Fig. 9 A), thereby 
15 eliminating the presumed dimerization domain. Both mutants did not exhibit 
any fiision activity compared to the wild type ftill-length S glycoprotein, which 
did (Fig. 9 A). To test whether a differential level of expression could accoxmt 
for the lack of observable fixsogenic activity, the surface and overall levels of 
expression were measured by flow cytometry and Western blotting. The data 
20 from both assays suggested that the level of expression of the two deletion 

mutants is undistinguishable from that of the wild type (Figs. 1 IB and C). These 
results suggest that the extreme N-terminus is required for fiision by a 
mechanism that may or may not involve dimerization. 

Dimeric SI binds ACE2 much more efficiently than monomeric 
25 fragments containing the Receptor Singing Domain. Previous work with 

another coronavirus (MHV) suggested that only dimeric SI binds its receptor 
CEACAM. Lewicki & Gallagher, J. Biol. Chem. 277:19727-734 (2002). 
Experiments were conducted on SARS-CoV fragments to understand how the 
dimeric state of the SI may affect fiision. In particular, binding of SI fragments 
30 in monovalent and bivalent form to ACE2 was observed by using the anti-c-Myc 
epitope antibody for conversion of monovalent SI fragments into bivalent ones. 
One of these SI fragments (S319-517, SEQ ID NO:62) did not bind to any 
measurable degree to surface-immobilized ACE2 unless bound by an anti-c-Myc 
epitope antibody, which converted it into a bivalent molecule in solution before 
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and during incubation with the receptor (Fig. 12). In contrast, S537 bound to 
ACE2 without the antibody although the antibody presence increased its binding 
(Fig. 12). These results suggest that a dimeric state of SI could contribute to an 
increased overall affinity that may enhance fusion efficiency. 
5 The soluble S ectodomain is a trimer. Viral envelope glycoproteins of 

class I fusion proteins such as hemagglutinin (HA) of influenza are trimeric 
through the transmembrane domain. Because the SARS-CoV S glycoprotein 
was recently found to be class I fusion protein, the S2 subimit may facilitate 
trimerization of the whole S glycoprotein. However, a dimeric SI with a 

10 trimeric S2 could lead to higher order oligomers whose formation depends on 
the availability of the dimerization binding site in the native S glycoprotein. To 
test this possibility the size of the soluble S ectodomains (Se) was approximated 
by gel filtration, where the transmembrane domain and the cytoplasmic tail were 
deleted. As shown in Fig. 13, a complex having the approximate size of a trimer 

1 5 (MW 512 kDa) was detected. No higher order oligomers were detected. These 
results not only suggest that the Se fragment and perhaps the full-length 
membrane-associated S are trimers in there native unbound state but also 
indicate that the dimerization site in SI is not readily available for intertrimer 
interactions. 

20 These results indicate the following: 1) the SU subunit of the SARS- 

CoV S glycoprotein (81) forms dimers, 2) the dimerization domain does not 
overlap and is upstream of the receptor binding domain, 3) deletion of the 
dimerization domain abolishes fusion, 4) dimeric SI binds receptor molecules 
much more efficiently than monovalent fragments containing the receptor 

25 binding domain, and 5) the soluble S ectodomain forms trimers under gel 
filtration conditions. 

It has been previously reported that some SU subunits of class I fusion 
proteins (that bind receptor molecules) can form dimers including, for example, 
gpl20 of the retrovims HIV-1 and SI of the coronavims MHV. Center et al. J. 

30 Virol. 74: 4448-55 (2000); Lewicki et al. J. Biol. Chem. 277: 19727-34 (2002). 
Until the present work, the role of SI dunerization for mediation of membrane 
fusion was unclear. It is now generally accepted that soluble ectodomains such 
as the gpl40 protein of the HIV-1 and SIV envelope glycoproteins (Env) form 
trimers although dimers and tetramers can be observed. Center et al. Proc. NatT 
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Acad. Sci, U.S.A. 98: 14877-82 (2001). Similarly, it appears that at least a 
possible fusion intermediate quaternary structure of coronaviruses including the 
SARS-CoV of S2 is trimeric. Liu et al. Lancet 363: 938-947 (2004); Bosch et al. 
Proc. Nat'l Acad. Sci. U.S.A. 101: 8455-60 (2004). In contrast, some data 
5 indicates that the MHV 82 protein is monomeric after dissociation from S 1 . 

Lewicki et al. J. Biol. Chem. 277: 19727-34 (2002). Dimer-to-trimer transitions 
play a critical role in the mechanism of fusion mediated by class 11 fusion 
proteins. Thus it has been proposed that changes in the quaternary structure of 
some coronavimses may play a role in the fusion mechanism. Id. One should 

10 note that both the HIV-1 Env and the MHV S glycoproteins are cleaved and the 
SU can dissociate from the transmembrane subunit, however, such dissociation 
may not be important for fiision. In contrast, the S ARS-CoV S is not cleaved 
when expressed in membrane associated or soluble form and cleavage may not 
be required for fusion. Thus, although the SARS-CoV S glycoprotein is a class I 

15 fusion protein, the lack of cleavage is an exception from the rule that the Envs of 
class I fusion proteins are cleaved presumably to confer a metastable high- 
energy state that could drive the fusion reaction. 

This finding that Ihe SU (SI) domain of the SARS-CoV S glycoprotein 
can form dimers and also forms trimers with the ectodomain of the 

20 transmembrane domain (S2) poses an interesting topological situation. Thus, if 
two of the monomers within a trimer also form a dimer, then the third monomer 
would still be free to interact with a "free" monomer from another trimer and 
form a dimer of the two trimers. In another scenario the orientation of each of 
the monomers in the trimer may not allow formation of dimers in the trimer but 

25 leave "free" binding sites for dimerization with monomers from other trimers. In 
this case one might expect the formation of a network of trimers. Finally, the 
three-dimensional structure of the trimer may not allow any interactions of the 
monomer dimerization sites with other monomers in the same or different trimer. 
The later possibiUty is supported by the preliminary data provided herein where 

30 higher order oligomers were not detected using the described gel filtration 

conditions. Under those conditions either intratrimer dimerization occurs but the 
third monomer conformation does not allow interactions with monomers from 
other trimers or such interactions are too weak to be detected, or the trimer three- 
dimensional structure is such that it does not allow dimerization interactions. 
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Data provided herein demonstrate lack of fusion after deletion of portions 
of the dimerization domain and indicate that the dimerization region may play a 
role in fusion ahhough its mechanism may not be through dimerization 
interactions. In addition, imder native conditions where the surface 
concentration of the S glycoprotem can be very high, as seen in electron 
micrographs, it is possible that dimerization interactions play a role m stabihzing 
a "network" of interacting molecules perhaps somewhat similar to networks of 
proteins that mediate entry of class II fusion proteins. Such networks, if any, 
could increase the avidity of interaction with receptor molecules and perhaps 
facilitate the formation of the fusion pore structure by providing a pre-assembled 
network of Env molecules or even provide energy to drive the fusion reaction in 
the absence of S cleavage that generates a high-energy metastable state. 

Example 20 

Sera from Mice Im munized with DNA Encoding RBD Polypeptides 

Inhibits S-Mediated Cell Fusion 
This Example illustrates that immunizing mammals with DNA encoding 
receptor binding domain polypeptides may prevent SARS infection. 

Materials and Methods 

Mice were divided into three groups: group A of mice # 1 through 5 were 
immunized with plasmid pSecTag-SRBD that encodes for the S3 19-518 
fragment that includes the receptor binding domain (RBD) of the spike protein; 
group B of mice #1 to #5 were immunized with the plasmid pEAK-lO-RBD-Fc 
25 that encodes for a fusion protein of RBD (S3 1 9-5 1 8) fragment fused to Fc and 
group C mice #1 to #3 which were immunized with a control plasmid. Five 
BALB/C mice per group were immunized at day 0, day 14 and day 28. Mice 
received less than 2 ug DNA per immunization with a gene gun. Sera were 
collected at day 56. In Fig. 14A-B, flie first number denotes an individual 
30 mouse, the letter denotes the respective inmiunization group, and the last number 
denotes the dilution used. 

Cells (293T) were incubated with anti-sera from the immunized mice and 
then mixed with cells expressing S protein. Fusion was measured as described in 
previous Examples (see also, Xiao et al. BBRC 2003). PC denotes positive 
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control where no serum was added. For mice #1 to #2 in each group, serum 
dilution factors of 10, 100, and 1000 were used. For mice #3-#5 in groups A and 
B, and #3 in the control group, dilution factors of 20 and 100 were used. 

5 Results 

The antibody titers for the anti-sera obtained from the mice are shown in 
Fig. 14 A. As shown, mice immimized with DNA encoding the spike protein 
receptor binding domain (S3 19-5 18, groups A and B) had very high titer anti- 
sera - dilutions up to 1 :7250 still reacted strongly to antigen in ELISA assays. 

10 As shown in FIG. 14B, anti-sera from mice immunized with DNA 

encoding the spike protein receptor binding domain inhibited ftision of cells that 
express the S protein in a dose dependent manner. Thus, anti-sera from mouse 
1 A and 2 A, which were immunized with DNA encoding the S receptor binding 
domain, substantially eliminated S-protein mediated cell ftision when used at a 

15 1:10 dilution. Higher dilutions ( 1 : 1 00 and 1 : 1 000) of this anti-sera were less 
effective. Similar results were observed on cell fusion inhibited by anti-sera 
from mouse 3 A (1 :20 dilution), from mouse 4A (1 :20 dilution), and from mouse 
5A (1:20 dilution). 

These data indicate that immunizing mammals with DNA encoding S 

20 protein receptor binding domain polypeptides can raise a strong immune 
response against the spike protein and could prevent S ARS infection. As 
described above, soluble fragments of the S glycoprotein that have the receptor 
binding domain inhibit S-mediated cell fusion (see Fig. 15). 
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20 

All patents and publications referenced or mentioned herein are 
indicative of the levels of skill of those skilled in the art to which the invention 
pertains, and each such referenced patent or publication is hereby incorporated 
25 by reference to the same extent as if it had been incorporated by reference in its 
entirety individually or set forth herein in its entirety. Applicants reserve the 
right to physically incorporate into this specification any and all materials and 
information from any such cited patents or publications. 

The specific methods and compositions described herein are 
30 representative of preferred embodiments and are exemplary and not intended as 
limitations on the scope of the invention. Other objects, aspects, and 
embodiments will occur to those skilled in the art upon consideration of this 
specification, and are encompassed within the spirit of the invention as defined 
by the scope of the claims. It will be readily apparent to one skilled in the art 
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that varying substitutions and modifications may be made to the invention 
disclosed herein v^ithout departing fi-om the scope and spirit of the invention. 
The invention illustratively described herein suitably may be practiced in the 
absence of any element or elements, or limitation or limitations, which is not 
5 specifically disclosed herein as essential. The methods and processes 

illustratively described herein suitably may be practiced in differing orders of 
steps, and that they are not necessarily restricted to the orders of steps indicated 
herein or in the claims. As used herein and in the appended claims, the singular 
forms "a," "an," and "the" include plural reference imless the context clearly 
10 dictates otherwise. Thus, for example, a reference to "a host cell" includes a 
plxurality (for example, a culture or population) of such host cells, and so forth. 
Under no circumstances may the patent be interpreted to be limited to the 
specific examples or embodiments or methods specifically disclosed herein. 
Under no circumstances may the patent be interpreted to be limited by any 
1 5 statement made by any Examiner or any other official or employee of the Patent 
and Trademark Office unless such statement is specifically and without 
qualification or reservation expressly adopted in a responsive writing by 
Applicants. 

The terms and expressions that have been employed axe used as terms of 
20 description and not of limitation, and there is no intent in the use of such terms 
and expressions to exclude any equivalent of the features shown and described 
or portions thereof, but it is recognized that various modifications are possible 
within the scope of the invention as claimed. Thus, it will be understood that 
although the present invention has been specifically disclosed by preferred 
25 embodiments and optional features, modification and variation of the concepts 
herein disclosed may be resorted to by those skilled in the art, and that such 
modifications and variations are considered to be within the scope of this 
invention as defined by the appended claims. 

The invention has been described broadly and generically herein. Each 
30 of the narrower species and subgeneric groupings falling within the generic 

disclosure also form part of the invention. This includes the generic description 
of the invention with a proviso or negative limitation removing any subject 
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matter from the genus, regardless of whether or not the excised material is 
specifically recited herein. 

Other embodiments are within the following claims. In addition, where 
features or aspects of the invention are described in terms of Markush groups, 
5 those skilled in the art will recognize that the invention is also thereby described 
in terms of any individual member or subgroup of members of the Markush 
group. 
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WHAT IS CLAIMED: 

1. A polypeptide fragment of SEQ ID NO: 1, or a conservative variant 
thereof, wherein the polypeptide can produce a humoral or cellular 
immune response when used to inoculate an animal. 

5 

2. A polypeptide having any one of SEQ ID NOs: 13, 14, 15, 20-59, 61-63, 
wherein the polypeptide, wherein the polypeptide can produce a humoral 
or cellular immune response when used to inoculate an animaL 

10 3, A polypeptide having any one of SEQ ID NOs: 13, 14, 15, 25, 34, 46, 51, 
52, 56, 57, 58, 59, 61, 62 or 63, wherein the polypeptide, wherein the 
polypeptide caa produce a humoral or cellular immune response when 
used to inoculate an animal. 

15 4. The polypeptide of claim 1 , 2 or 3, wherein the polypeptide is soluble in 
an aqueous solution. 

5. The polypeptide of claim 1, 2 or 3, wherein the animal is a mammal. 

20 6. The polypeptide of claim 5, wherein the mammal is a human. 

7. The polypeptide of claim 1, 2 or 3, wherein the polypeptide is amino- 
terminally or carboxyl-terminally blocked. 

25 8. A coupled protein comprising a carrier protein coupled to a second 

polypeptide having any one of (a) SEQ ID NOs: 13, 14, 15, 20-59, 61- 
63; (b) a peptide fragment of SEQ ID NO: 1, or a conservative variant of 
(a)or(b). 

30 9. The coupled protein of claim 8, wherein the carrier protein is soluble in 
an aqueous solution. 
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The coupled protein of claim 9, wherein the carrier protein is selected 
from the group consisting of bovine serum albumin, keyhole limpet 
hemacyanin, ovalbumin, mouse serum albumin, rabbit serum albxmiin. 

The coupled protein of claim 8, wherein the coupled protein produces a 
humoral or a cellular immune response when used to inoculate an animal. 

The coupled protein of claim 11, wherein the animal is a mammal. 

The coupled protein of claim 12, wherein the mammal is a human. 

An immunopeptide comprising a polypeptide having any one of (a) SEQ 
ID NOs: 13, 14, 15, 20-59, 61-63; or (b) a fragment of SEQ ID NO: 1; 
coupled to arsanilic acid, sulfaniUc acid, an acetyl group, or a picryl 
group. 

The immunopeptide of claim 14, wherein the immunopeptide produces a 
humoral or a cellular immune response when used to inoculate an animal. 

The immunopeptide of claim 15, wherein the animal is a mammal. 

The immunopeptide of claim 16, wherein the mammal is a human. 

An inmiune composition comprising an adjuvant and a polypeptide 
having any one of SEQ ID NOs: 13, 14, 15, 20-59, 61-63; or a fragment 
ofSEQIDNO: 1. 

The immune composition of claim 18, wherein the adjuvant is selected 
from the group consisting of aluminum hydroxide, lipid A, killed 
bacteria, polysaccharide, mineral oil, Freund's incomplete adjuvant, 
Frexmd's complete adjuvant, aluminum phosphate, iron, zinc, a calcium 
salt, acylated tyrosine, an acylated sugar, a cationically derivatized 
polysaccharide, an anionically derivatized polysaccharide, a 
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polyphosphazine, a biodegradable microsphere, a monophosphoryl lipid 
A, and quil A. 

20. The immune composition of claim 18, wherein the polypeptide is araino- 
5 terminally or carboxyl-terminally blocked. 

21. A peptidomimetic of an amino acid sequence having any one of (a) SEQ 
ID NOs: 13, 14, 15, 20-59, 61-63; (b) a fragment of SEQ ID NO: 1, or a 
conservative variant of (a) or (b), 

10 

22. An immune composition comprising an adjuvant and a peptidomimetic 
of an amino acid sequence having any one of (a) SEQ ID NOs: 13, 14, 
15, 20-59, 61-63; (b) a fragment of SEQ ID NO: 1; or a conservative 
variant of (a) or (b). 

15 

23. The immune composition of claim 22, wherein the adjuvant is selected 
from the group consisting of aluminum hydroxide, lipid A, killed 
bacteria, polysaccharide, mineral oil, Freund's incomplete adjuvant, 
Freimd's complete adjuvant, aluminum phosphate, iron, zinc, a calcium 

20 salt, acylated tyrosine, an acylated sugar, a cationically derivatized 

polysaccharide, an anionically derivatized polysaccharide, a 
polyphosphazine, a biodegradable microsphere, a monophosphoryl lipid 
A, and quil A. 

25 24. A nucleic acid segment that encodes a polypeptide having any one of (a) 
SEQ ED NOs: 13, 14, 15, 20-59, 61-63; (b) a peptide that is a fragment of 
SEQ ID NO: 1, or a conservative variant of (a) or (b). 

25. An expression cassette comprising a promoter that is operably linked to a 
30 nucleic acid segment that encodes a polypeptide having any one of (a) 

SEQ ID NOs: 13, 14, 15, 20-59, 61-63; (b) a peptide that is a fragment of 
SEQ ID NO: 1; or a conservative variant of (a) or (b). 



98 



wo 2005/010034 



PCT/US2004/023345 



26, The expression cassette according to claim 25, wherein the promoter is a 
constitutive promoter or a regulated promoter. 

27. A nucleic acid construct comprising a vector and a nucleic acid segment 
5 that encodes (a) a polypeptide having any one of SEQ ID NOs: 13, 14, 

15, 20-59, 61-63; (b) apeptide that is a fragment of SEQ ID NO: 1; (c) a 
conservative variant of (a) or (b); or an expression cassette according to 
claim 25. 

10 28. The nucleic acid construct according to claim 27, wherein the vector is 
selected from the group consisting of a plasmid, a cosmid, a yeast 
artificial chromosome, a bacterial artificial chromosome, an F-factor, a 
virus, an expression vector, and a phagemid. 

15 29. A recombinant vims comprising a viral vector and ,a nucleic acid segment 
that encodes (a) a polypeptide having any one of SEQ ID NOs: 13, 14, 
15, 20-59, 61-63; (b) a peptide that is a fragment of SEQ ID NO: 1; (c) a 
conservative variant of (a) or (b); or an expression cassette according to 
claim 25. 

20 

30. The recombinant virus of claim 29, wherein the viral vector is selected 
from the group consistmg of vaccinia virus, canarypox, adenovirus, and 
herpes viras. 

25 31. A composition comprising a pharmaceutical carrier and (a) a polypeptide 
having any one of SEQ ID NOs: 13, 14, 15, 20-59, 61-63; (b) apeptide 
that is a fragment of SEQ ID NO: 1; or (c) a conservative variant of (a) or 
(b). 

30 32. The composition of claim 3 1, wherein the composition is formulated for 
treatment of SARS-CoV. 

33 . The composition of claim 3 1, wherein the composition is formulated for 
inhibition of SARS-CoV ftision with, or entry into, mammalian cells. 
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34. A composition comprising a phamiaceutical carrier and a nucleic acid 
segment that encodes (a) a polypeptide having any one of SEQ ID NOs: 
13, 14, 15, 20-59, 61-63; (b) a peptide that is a fragment of SEQ ID NO: 

5 1; (c) a conservative variant of (a) or (b); or an expression cassette 

according to claim 30. 

35. The composition of claim 34, wherein the composition is formulated for 
treatment of SARS-CoV. 

10 

36. The composition of claim 34, wherein the composition is formulated for 
prevention of SARS-CoV fusion with, or entry into, mammalian cells. 

37. A viral vaccine comprising a pharmaceutical carrier, a viral vector and a 
15 nucleic acid segment that encodes (a) a polypeptide having any one of 

SEQ ID NOs: 13, 14, 15, 20-59, 61-63; (b) a peptide that is a fragment of 
SEQ ID NO: 1 ; (c) a conservative variant of (a) or (b); or an expression 
cassette according to claim 30. 

20 38. The viral vaccine according to claim 34, wherein the viral vaccine is 
formulated in unit dosage form, 

39. A peptide vaccine comprising a phamiaceutical carrier and (a) a peptide 
having any one of SEQ ID NOs: 13, 14, 15, 20-59, 61-63; (b) a fragment 

25 of SEQ ID NO: 1 ; (c) a peptidomimetic of (a) or (b); (d) or a 

conservative variant of (a) or (b). 

40. The peptide vaccine according to claim 39, wherein the peptide vaccine 
is formulated in unit dosage form. 

30 

41 . A microorganism vaccine comprising a pharmaceutical carrier and a 
microorganism that expresses (a) a peptide having any one of SEQ ID 
NOs: 13, 14, 15, 20-59, 61-63; (b) a fragment of SEQ ID NO: 1; or (c) a 
conservative variant of (a) or (b). 
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42. The microorganism vaccine according to claim 41, wherein the 
microorganism is selected jfrom the group consisting of Sahnonella and 
Listeria monocytogenes. 

5 

43. The microorganism vaccine according to claim 42, wherein the 
microorganism vaccine is formulated in unit dosage form. 

A DNA vaccine comprising a pharmaceutical carrier and vector into 
which is inserted a nucleic acid segment that encodes (a) an amino acid 
sequence as put forth in any one of SEQ ID NOs: 13, 14, 15, 20-59, 61- 
63; (b) a fragment of SEQ ID NO: 1; or (c) a conservative variant of (a) 
or(b). 



44. 

10 



15 45. The DNA vaccine according to claim 44, wherein the vector is selected 
from the group consisting of a plasmid, a cosmid, a yeast artificial 
chromosome, a bacterial artificial chromosome, an F-factor, a vims, and 
a phagemid. 

20 46. The DNA vaccine according to claim 44, wherem the DNA vaccine is 
formulated in unit dosage form. 

47. The DNA vaccine according to claim 46, wherein the DNA vaccine 
finUier comprises a myonecrotic agent. 

25 

48. The DNA vaccine according to claim 47, wherein the myonecrotic agent 
is bupivicaine or cardiotoxin. 

49. An antibody that binds to an amino acid sequence as set forth in any one 
30 of SEQ ID NOs: 13, 14, 15, 20-59, 61-63; or a flragment of SEQ ID NO: 

1. 
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The antibody according to claim 49, wherein the antibody specifically 
binds to an amino acid sequence as set forth in any one of SEQ ID NOs: 
13, 14, 15, 20-59, 61-63; or a fragment of SEQ ID NO: 1. 

The antibody according to claim 49, wherein the antibody specifically 
binds to a S protein receptor binding domain. 

The antibody according to claim 49, wherein the antibody is a 
monoclonal antibody, a polyclonal antibody, a single-chain antibody, an 
antigen-binding antibody fragment, or a humanized antibody. 

The antibody according to claim 52, wherein the antigen-binding 
antibody fragment is an scFv, Fv, Fab*, Fab, diabody, linear antibody or 
F(ab')2. 

The antibody according to claim 49, wherein the antibody is coupled to a 
detectable tag. 

The antibody according to claim 54, wherein the detectable tag is a 
fluorescent protein, a fluorescent marker, a radiolabel, an enzyme, or an 
affinity tag. 

The antibody according to claim 49, wherein the antibody is coupled to a 
toxin. 

The antibody according to claim 56, wherein the toxin is an A chain 
toxin, a ribosome inactivating protein, a-sarcin, gelonin, aspergillin, 
rcstrictocin, a ribonuclease, an epipodophyllotoxin, diphtheria toxin, 
Pseudomonas exotoxin, ricin, doxorubicin, daunombicin, taxol, ethiduim 
bromide, mitomycin, etoposide, tenoposide, vincristine, vinblastine, 
colchicine, dihydroxy anthracin dione, actinomycin D, PE40, abrin, or a 
glucocorticoid. 
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A pharmaceutical composition comprising a pharmaceutical carrier and 
an antibody that binds to an amino acid sequence as set forth in any one 
of SEQ ID NOs: 13, 14, 15, 20-59, 61-63; or a fragment of SEQ ID NO: 
1. 

A method to immimize a mammal against severe acute respiratory 
syndrome comprising administering to the mammal a therapeutically 
effective amoimt of an antibody that binds to an amino acid sequence as 
set forth in any one of SEQ ID NOs: 13, 14, 15, 20-59, 61-63; or a 
fragment of SEQ ID NO: 1. 

The method of claim 59, wherein the antibody specifically binds to an 
amino acid sequence as set forth in any one of SEQ ID NOs: 13, 14, 15, 
20-59, 61-63; or a fragment of SEQ ID NO: 1. 

The method of claim 59, wherein the mammal is a human. 

A method to treat severe acute respiratory syndrome in a mammal 
comprising administering to the mammal a therapeutically effective 
amount of an antibody that binds to an amino acid sequence as set forth 
in any one of SEQ ID NOs: 13, 14, 15, 20-59, 61-63; or a fragment of 
SEQ ID NO: 1. 

The method of claim 62, wherein the antibody specifically binds to an 
amino acid sequence as set forth in any one of SEQ ID NOs: 13, 14, 15, 
20-59, 61-63; or a fragment of SEQ ID NO: 1. 

The method of claim 62, wherein the mammal is a human. 

The method of claim 59 or 62, wherein the antibody is formulated with a 
pharmaceutical carrier or diluent. 

A method for treating or inhibiting severe acute respiratory syndrome in 
a mammal comprising administering to the mammal a therapeutically 
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effective amount of a S polypeptide comprising an amino acid sequence 
as set forth in any one of SEQ ID NOs: 13, 14, 15, 20-59, 61-63; or a 
fragment of SEQ ID NO: 1. 

A method for raising an immmie response in a mammal against a S ARS 
coronavirus spike protein comprising administering a therapeutically 
effective amoimt of a polypeptide comprising an anndno acid sequence as 
set forth in.any one of SEQ ID NOs: 13, 14, 15, 20-59, 61-63; or a 
fragment of SEQ ID NO: 1. 

The method of claim 67, wherein the polypeptide comprises an amino 
acid sequence as set forth in any one of SEQ ID NOs: 13, 14, 15, 25, 34, 
51, 52, 56, 57, 58, 59, 61, 62, 63; or a fragment of SEQ ID NO: 1. 

The method of claim 67, wherein the mammal is a hxraiau. 

A method to diagnose severe acute respiratory syndrome in an animal 
comprising: 

(a) contacting a biological sample obtained from the animal with 
an antibody that binds to an amino acid sequence as set forth in any one 
of SEQ ID NOs: 13, 14, 15, 20-59, 61-63; or a fragment of SEQ ID NO: 
1; and 

(b) determining if the antibody binds to the biological sample. 
The method of claim 70, wherein the animal is a mammal. 

The method of claim 70, wherein the mammal is a human. 

A method for making an antibody comprising: obtaining an animal that 
was immunized with (a) a peptide fragment of a polypeptide having an 
amino acid sequence as set forth in SEQ ID NO: 1; (b) a polypeptide 
having an amino acid sequence as set forth in any one of SEQ ID NOs: 
13, 14, 15, 20-59, 61-63; (p) a peptidemimetic of (a) or (b), or (d) a 
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conservative variant of (a) or (b); and isolating an antibody that binds to 
(a). 

74. A method to make an antibody comprising: obtaining an animal that was 
5 inamunized with a coupled protein having a carrier protein coupled to (a) 

a peptide fragment of a polypeptide having an amino acid sequence as 
set forth in SEQ ID NO: 1; (b) a polypeptide having an amino acid 
sequence as set forth in any one of SEQ ID NOs: 1, 13, 14, 15, 20-55; (c) 
a peptidemimetic of (a) or (b), or (d) a conservative variant of (a) or (b); 
1 0 and isolating an antibody that binds to a polypeptide having an amino 

acid sequence as set forth in SEQ ID NO: 1 . 

75. A kit comprising packaging material and an antibody or aptamer that 
binds to an amino acid sequence as set forth in any one of SEQ ID NOs: 

15 1, 13, 14, 15, 20-59, 61-63; or a fragment of SEQ ID NO: 1. 

76. The kit of claim 75, wherein the antibody is formulated with a 
pharmaceutical carrier or diluent. 

20 77. The kit of claim 75, further comprising a syringe, 

78. A kit comprising packaging material and a therapeutically effective 
amount of a S polypeptide comprising an amino acid sequence as set 
forth in any one of SEQ ID NOs: 13, 14, 15, 20-59, 61-63; or a fragment 

25 of SEQ ID NO: 1. 

79. The kit of claim 78, wherein the S polypeptide is formulated with a 
pharmaceutical carrier or diluent. 

30 80. The kit of claim 78, further comprising a syringe. 

81. A monoclonal antibody that specifically binds to an amino acid sequence 
as set forth in any one of SEQ ID NOs:l, 13, 14, 15, 20-59, 61-63. 
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82. An isolated polyclonal antibody that specifically binds to an amino acid 
sequence as set forth in any one of SEQ ID NOs:l, 13, 14, 15, 20-59, 61- 
63. 

5 83. An aptamer that binds to an amino acid sequence as set forth in aay one 
of SEQ ID NOs:l, 13, 14, 15, 20-59, 61-63; or a fragment of SEQ ID 
NO: 1. 

84. A pharmaceutical composition comprising a pharmaceutical carrier and 
10 an aptamer that binds to an amino acid sequence as set forth in aay one of 

SEQ ID NOs:l, 13, 14, 15, 20-59, 61-63; or a fragment of SEQ ID NO: 
1. 
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SEQUENCE LISTING 



5 



<110> 



National Institutes of Health 
Dimitrov, Dimiter S. 
Xiao, Xiaodong 



<120> Soluble Fragments of the SARS-CoV Spike Glycoprotein 

10<130> 1662.024WO1 

<150> US 60/489,166 
<151> 2003-07-21 

15<150> US 60/524,642 
<151> 2003-11-25 

<160> 65 

20<170> FastSEQ for Windows Version 4.0 

<210> 1 

<211> 1255 

<212> PRT 

25<213> SARS coronavirus 

<400> 1 

Met Phe lie Phe Leu Leu Phe Leu Thr Leu Thr Ser Gly Ser Asp Leu 
15 10 15 

3 0Asp Arg Cys Thr Thr Phe Asp Asp Val Gin Ala Pro Asn Tyr Thr Gin 
20 25 30 

His Thr Ser Ser Met Arg Gly Val Tyr Tyr Pro Asp Glu lie Phe Arg 

35 40 45 

Ser Asp Thr Leu Tyr Leu Thr Gin Asp Leu Phe Leu Pro Phe Tyr Ser 
35 50 55 60 

Asn Val Thr Gly Phe His Thr lie Asn His Thr Phe Gly Asn Pro Val 
^5 70 75 80 

lie Pro Phe Lys Asp Gly He Tyr Phe Ala Ala Thr Glu Lys Ser Asn 



4 oval Val Arg Gly Trp Val Phe Gly Ser Thr Met Asn Asn Lys Ser Gin 

100 105 110 

Ser Val He He He Asn Asn Ser Thr Asn Val Val He Arg Ala Cys 



85 



90 



95 



115 



120 



125 
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Asn Phe Glu Leu 
130 

Gly Thr Gin Thr 
145 

SPhe Glu Tyr lie 

Gly Asn Phe Lys 
180 

Phe Leu Tyr Val 
10 195 
Leu Pro Ser Gly 

210 

Gly lie Asn lie 
225 

15Ala Gin Asp lie 

Leu Lys Pro Thr 
260 

Thr Asp Ala Val 
20 275 

Ser Val Lys Ser 
290 

Phe Arg Val Val 
305 

2 5 Asn Leu Cys Pro 

Val Tyr Ala Trp 
340 

Ser Val Leu Tyr 
30 355 

Val Ser Ala Thr 
370 

Asp Ser Phe Val 
385 

35Gln Thr Gly Val 

Met Gly Cys Val 
420 

Thr Gly Asn Tyr 
40 435 
Arg Pro Phe Glu 
450 



Cys Asp Asn Pro 
135 

His Thr Met He 
150 

Ser Asp Ala Phe 
165 

His Leu Arg Glu 

Tyr Lys Gly Tyr 
200 

Phe Asn Thr Leu 
215 

Thr Asn Phe Arg 
230 

Trp Gly Thr Ser 
245 

Thr Phe Met Leu 

Asp Cys Ser Gin 
280 

Phe Glu He Asp 
295 

Pro Ser Gly Asp 

310 

Phe Gly Glu Val 
325 

Glu Arg Lys Lys 

Asn Ser Thr Phe 

360 

Lys Leu Asn Asp 
375 

Val Lys Gly Asp 
390 

He Ala Asp Tyr 

405 

Leu Ala Trp Asn 

Asn Tyr Lys Tyr 
440 

Arg Asp lie Ser 
455 



2 

Phe Phe Ala Val 

140 

Phe Asp Asn Ala 
155 

Ser Leu Asp Val 
170 

Phe Val Phe Lys 
185 

Gin Pro He Asp 

Lys Pro He Phe 
220 

Ala He Leu Thr 
235 

Ala Ala Ala Tyr 
250 

Lys Tyr Asp Glu 
265 

Asn Pro Leu Ala 

Lys Gly He Tyr 
300 

Val Val Arg Phe 
315 

Phe Asn Ala Thr 
330 

lie Ser Asn Cys 
345 

Phe Ser Thr Phe 

Leu Cys Phe Ser 
380 

Asp Val Arg Gin 
395 

Asn Tyr Lys Leu 

410 

Thr Arg Asn He 
425 

Arg Tyr Leu Arg 

Asn Val Pro Phe 
460 



Ser Lys Pro Met 

Phe Asn Cys Thr 
160 

Ser Glu Lys Ser 
175 

Asn Lys Asp Gly 

190 

Val Val Arg Asp 
205 

Lys Leu Pro Leu 

Ala Phe Ser Pro 

240 

Phe Val Gly Tyr 
255 

Asn Gly Thr He 
270 

Glu Leu Lys Cys 

285 

Gin Thr Ser Asn 

Pro Asn He Thr 
320 

Lys Phe Pro Ser 

335 

Val Ala Asp Tyr 
350 

Lys Cys Tyr Gly 
365 

Asn Val Tyr Ala 

He Ala Pro Gly 
400 

Pro Asp Asp Phe 

415 

Asp Ala Thr Ser 
430 

His Gly Lys Leu 
445 

Ser Pro Asp Gly 
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Lys Pro Cys Thr Pro Pro Ala Leu Asn Cys Tyr Trp Pro Leu Asn Asp 
465 470 475 480 

Tyr Gly Phe Tyr Thr Thr Thr Gly lie Gly Tyr Gin Pro Tyr Arg Val 
485 490 495 

5 Val Val Leu Ser Phe Glu Leu Leu Asn Ala Pro Ala Thr Val Cys Gly 
500 505 510 

Pro Lys Leu Ser Thr Asp Leu He Lys Asn Gin Cys Val Asn Phe Asn 

515 520 525 

Phe Asn Gly Leu Thr Gly Thr Gly Val Leu Thr Pro Ser Ser Lys Arg 
10 530 535 540 

Phe Gin Pro Phe Gin Gin Phe Gly Arg Asp Val Ser Asp Phe Thr Asp 
545 550 555 560 

Ser Val Arg Asp Pro Lys Thr Ser Glu He Leu Asp He Ser Pro Cys 
565 570 575 

15Ala Phe Gly Gly Val Ser Val He Thr Pro Gly Thr Asn Ala Ser Ser 
580 585 590 

Glu Val Ala Val Leu Tyr Gin Asp Val Asn Cys Thr Asp Val Ser Thr 

595 600 605 

Ala He His Ala Asp Gin Leu Thr Pro Ala Trp Arg He Tyr Ser Thr 
20 610 615 620 

Gly Asn Asn Val Phe Gin Thr Gin Ala Gly Cys Leu He Gly Ala Glu 
625 630 635 640 

His val Asp Thr Ser Tyr Glu Cys Asp He Pro He Gly Ala Gly He 
645 650 655 

25Cys Ala Ser Tyr His Thr Val Ser Leu Leu Arg Ser Thr Ser Gin Lys 
660 665 670 

Ser He Val Ala Tyr Thr Met Ser Leu Gly Ala Asp Ser Ser He Ala 

675 680 685 

Tyr Ser Asn Asn Thr He Ala He Pro Thr Asn Phe Ser He Ser He 
30 690 695 700 

Thr Thr Glu Val Met Pro Val Ser Met Ala Lys Thr Ser Val Asp Cys 
705 710 715 720 

Asn Met Tyr He Cys Gly Asp Ser Thr Glu Cys Ala Asn Leu Leu Leu 
725 730 735 

35Gln Tyr Gly Ser Phe Cys Thr Gin Leu Asn Arg Ala Leu Ser Gly He 
740 745 750 

Ala Ala Glu Gin Asp Arg Asn Thr Arg Glu Val Phe Ala Gin Val Lys 

755 760 765 

Gin Met Tyr Lys Thr Pro Thr Leu Lys Tyr Phe Gly Gly Phe Asn Phe 
40 770 775 780 

Ser Gin He Leu Pro Asp Pro Leu Lys Pro Thr Lys Arg Ser Phe He 
785 790 795 800 
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Glu Asp Leu Leu Phe Asn Lys Val Thr Leu Ala Asp Ala Gly Phe Met 

805 810 815 

Lys Gin Tyr Gly Glu Cys Leu Gly Asp lie Asn Ala Arg Asp Leu lie 
820 825 830 

5 Cys Ala Gin Lys Phe Asn Gly Leu Thr Val Leu Pro Pro Leu Leu Thr 
835 840 845 

Asp Asp Met lie Ala Ala Tyr Thr Ala Ala Leu Val Ser Gly Thr Ala 

850 855 860 

Thr Ala Gly Trp Thr Phe Gly Ala Gly Ala Ala Leu Gin" lie Pro Phe 
10865 870 875 880 

Ala Met Gin Met Ala Tyr Arg Phe Asn Gly lie Gly Val Thr Gin Asn 

885 890 895 

Val Leu Tyr Glu Asn Gin Lys Gin lie Ala Asn Gin Phe Asn Lys Ala 
900 905 910 

15 lie Ser Gin lie Gin Glu Ser Leu Thr Thr Thr Ser Thr Ala Leu Gly 
915 920 925 

Lys Leu Gin Asp Val Val Asn Gin Asn Ala Gin Ala Leu Asn Thr Leu 

930 935 940 

Val Lys Gin Leu Ser Ser Asn Phe Gly Ala lie Ser Ser Val Leu Asn 
20945 950 955 960 

Asp lie Leu Ser Arg Leu Asp Lys Val Glu Ala Glu Val Gin lie Asp 

965 970 975 

Arg Leu lie Thr Gly Arg Leu Gin Ser Leu Gin Thr Tyr Val Thr Gin 
980 985 990 

25Gln Leu He Arg Ala Ala Glu He Arg Ala Ser Ala Asn Leu Ala Ala 
995 1000 1005 

Thr Lys Met Ser Glu Cys Val Leu Gly Gin Ser Lys Arg Val Asp Phe 

1010 1015 1020 

Cys Gly Lys Gly Tyr His Leu Met Ser Phe Pro Gin Ala Ala Pro His 
301025 1030 1035 1040 

Gly Val Val Phe Leu His Val Thr Tyr Val Pro Ser Gin Glu Arg Asn 

1045 1050 1055 

Phe Thr Thr Ala Pro Ala He Cys His Glu Gly Lys Ala Tyr Phe Pro 
1060 1065 1070 

35Arg Glu Gly Val Phe Val Phe Asn Gly Thr Ser Trp Phe He Thr Gin 
1075 1080 1085 

Arg Asn Phe Phe Ser Pro Gin He He Thr Thr Asp Asn Thr Phe Val 

1090 1095 1100 

Ser Gly Asn Cys Asp Val Val He Gly He He Asn Asn Thr Val Tyr 
401105 1110 1115 1120 

Asp Pro Leu Gin Pro Glu Leu Asp Ser Phe Lys Glu Glu Leu Asp Lys 
1125 1130 1135 
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Tyr Phe Lys Asn His Thr Ser Pro Asp Val Asp Leu Gly Asp lie Ser 

1140 1145 1150 

Gly He Asn Ala Ser Val Val Asn He Gin Lys Glu He Asp Arg Leu 
1155 1160 1165 

5Asn Glu Val Ala Lys Asn Leu Asn Glu Ser Leu He Asp Leu Gin Glu 
1170 1175 1180 

Leu Gly Lys Tyr Glu Gin Tyr He Lys Trp Pro Trp Tyr Val Trp Leu 
1185 1190 1195 120( 

Gly Phe He Ala Gly Leu He Ala He Val Met Val Thr He Leu Leu 
10 1205 1210 1215 

Cys Cys Met Thr Ser Cys Cys Ser Gys Leu Lys Gly Ala Cys Ser Cys 

1220 1225 1230 

Gly Ser Cys Cys Lys Phe Asp Glu Asp Asp Ser Glu Pro Val Leu Lys 
1235 1240 1245 

15Gly Val Lys Leu His Tyr Thr 



<210> 2 
<211> 3768 
20<212> DNA 

<213> SARS coronavirus 

<400> 2 

atgtttattt tcttattatt tcttactctc actagtggta gtgaccttga ccggtgcacc 60 

25acttttgatg atgttcaagc tcctaattac actcaacata cttcatctat gaggggggtt 120 

tactatcctg atgaaatttt tagatcagac actctttatt taactcagga tttatttctt 180 

ccattttatt ctaatgttac agggtttcat actattaatc atacgtttgg caaccctgtc 24 0 

atacctttta aggatggtat ttattttgct gccacagaga aatcaaatgt tgtccgtggt 3 00 

tgggtttttg gttctaccat gaacaacaag tcacagtcgg tgattattat taacaattct 360 

3 0actaatgttg ttatacgagc atgtaacttt gaattgtgtg acaacccttt ctttgctgtt 42 0 

tctaaaccca tgggtacaca gacacatact atgatattcg ataatgcatt taattgcact 480 

ttcgagtaca tatctgatgc cttttcgctt gatgtttcag aaaagtcagg taattttaaa 540 

cacttacgag agtttgtgtt taaaaataaa gatgggtttc tctatgttta taagggctat 600 

caacctatag atgtagttcg tgatctacct tctggtttta acactttgaa acctattttt 660 

35aagttgcctc ttggtattaa cattacaaat tttagagcca ttcttacagc cttttcacct 72 0 

gctcaagaca tttggggcac gtcagctgca gcctattttg ttggctattt aaagccaact 780 

acatttatgc tcaagtatga tgaaaatggt acaatcacag atgctgttga ttgttctcaa 340 

aatccacttg ctgaactcaa atgctctgtt aagagctttg agattgacaa aggaatttac 90 0 

cagacctcta atttcagggt tgttccctca ggagatgttg tgagattccc taatattaca^ 96 0 

4 0aacttgtgtc cttttggaga ggtttttaat gctactaaat tcccttctgt ctatgcatgg 102 0 

gagagaaaaa aaatttctaa ttgtgttgct gattactctg tgctctacaa ctcaacattt IO8O 

ttttcaacct ttaagtgcta tggcgtttct gccactaagt tgaatgatct ttgcttctcc 1140 
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aatgtctatg cagattcttt tgtagtcaag ggagatgatg taagacaaat agcgccagga 12 0 0 

caaactggtg ttattgctga ttataattat aaattgccag atgatttcat gggttgtgtc 1260 

cttgcttgga atactaggaa cattgatgct acttcaactg gtaattataa ttataaatat 132 0 

aggtatctta gacatggcaa gcttaggccc tttgagagag acatatctaa tgtgcctttc 1380 

Stcccctgatg gcaaaccttg caccccacct gctcttaatt gttattggcc attaaatgat 1440 

tatggttttt acaccactac tggcattggc taccaacctt acagagttgt agtactttct 1500 

tttgaacttt taaatgcacc ggccacggtt tgtggaccaa aattatccac tgaccttatt 15 60 

aagaaccagt gtgtcaattt taattttaat ggactcactg gtactggtgt gttaactcct 1S2 0 

tcttcaaaga gatttcaacc atttcaacaa tttggccgtg atgtttctga tttcactgat 1680 

lOtccgttcgag atcctaaaac atctgaaata ttagacattt caccttgcgc ttttgggggt 1740 

gtaagtgtaa ttacacctgg aacaaatgct tcatctgaag ttgctgttct atatcaagat 18 00 

gttaactgca ctgatgtttc tacagcaatt catgcagatc aactcacacc agcttggcgc I860 

atatattcta ctggaaacaa tgtattccag actcaagcag gctgtcttat aggagctgag 1920 

catgtcgaca cttcttatga gtgcgacatt cctattggag ctggcatttg tgctagttac 1980 

IScatacagttt ctttattacg tagtactagc caaaaatcta ttgtggctta tactatgtct 2 040 

ttaggtgctg atagttcaat tgcttactct aataacacca ttgctatacc tactaacttt 2100 

tcaattagca ttactacaga agtaatgcct gtttctatgg ctaaaacctc cgtagattgt 2160 

aatatgtaca tctgcggaga ttctactgaa tgtgctaatt tgcttctcca atatggtagc 2220 

ttttgcacac aactaaatcg tgcactctca ggtattgctg ctgaacagga tcgcaacaca 2280 

20cgtgaagtgt tcgctcaagt caaacaaatg tacaaaaccc caactttgaa atattttggt 2340 

ggttttaatt tttcacaaat attacctgac cctctaaagc caactaagag gtcttttatt 2400 

gaggacttgc tctttaataa ggtgacactc gctgatgctg gcttcatgaa gcaatatggc 2460 

gaatgcctag gtgatattaa tgctagagat ctcatttgtg cgcagaagtt caatggactt 252 0 

acagtgttgc cacctctgct cactgatgat atgattgctg cctacactgc tgctctagtt 2580 

25agtggtactg ccactgctgg atggacattt ggtgctggcg ctgctcttca aatacctttt 2640 

gctatgcaaa tggcatatag gttcaatggc attggagtta cccaaaatgt tctctatgag 2700 

aaccaaaaac aaatcgccaa ccaatttaac aaggcgatta gtcaaattca agaatcactt 2 760 

acaacaacat caactgcatt gggcaagctg caagacgttg ttaaccagaa tgctcaagca 2 820 

ttaaacacac ttgttaaaca acttagctct aattttggtg caatttcaag tgtgctaaat 2 880 

30gatatccttt cgcgacttga taaagtcgag gcggaggtac aaattgacag gttaattaca 2940 

ggcagacttc aaagccttca aacctatgta acacaacaac taatcagggc tgctgaaatc 3 00 0 

agggcttctg ctaatcttgc tgctactaaa atgtctgagt gtgttcttgg acaatcaaaa 3 060 

agagttgact tttgtggaaa gggctaccac cttatgtcct tcccacaagc agccccgcat 3120 

ggtgttgtct tcctacatgt cacgtatgtg ccatcccagg agaggaactt caccacagcg 3180 

35ccagcaattt gtcatgaagg caaagcatac ttccctcgtg aaggtgtttt tgtgtttaat 3240 

ggcacttctt ggtttattac acagaggaac ttcttttctc cacaaataat tactacagac 3300 

aatacatttg tctcaggaaa ttgtgatgtc gttattggca tcattaacaa cacagtttat 33 60 

gatcctctgc aacctgagct cgactcattc aaagaagagc tggacaagta cttcaaaaat 342 0 

catacatcac cagatgttga tcttggcgac atttcaggca ttaacgctto tgtcgtcaac 3430 

40attcaaaaag aaattgaccg cctcaatgag gtcgctaaaa atfctaaatga atcactcatt 3540 

gaccttcaag aattgggaaa atatgagcaa tatattaaat ggccttggta tgtttggctc 3600 

ggcttcattg ctggactaat tgccatcgtc atggttacaa tcttgctttg ttgcatgact 3660 
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agttgttgca gttgcctcaa gggtgcatgc tcttgtggtt cttgctgcaa gtttgatgag 372 0 
gatgactctg agccagttct caagggtgtc aaattacatt acacataa 3768 



<210> 3 , 
5<211> 29 

<212> DNA 

<213> Artificial Sequence 



<220> 

10<223> A synthetic primer 

t 

<400> 3 

agtcggatcc ggtaggctta tcattagag 2 9 

15<210> 4 
<211> 20 
<212> DNA 

<213> Artificial Sequence ' 

20<220> 

<223> A synthetic primer 

<400> 4 

ccatcagggg agaaaggcac 2 0 

25 

<210> 5 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

30 

<220> 

<223> A synthetic primer 



<400> 5 
35gtgcctttct cccctgatgg 

<210> 6 
<211> 19 
<212> DNA 
40<213> Artificial Sequence 
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<220> 

<223> A synthetic primer 

<400> 6 
Sgaagagcagc gccagcacc 

<210> 7 
<211> 19 
<212> DNA 
10<213> Artificial Sequence 

<220> 

<223> A synthetic primer 

15<400> 7 

ggtgctggcg ctgctcttc 

<210> 8 
<211> 28 
20<212> DNA 

<213> Artificial Sequence 

<220> 

<223> A synthetic primer 

25 

<400> 8 

actgtctaga gttcgtttat gtgtaatg 

<210> 9 
30<211> 29 
<212> DNA 

<213> Artificial Sequence 

<220> 

35<223> A synthetic primer 
<400> 9 

agtcggatcc gaccggtgca ccacttttg 
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19 



19 



28 



40 
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<210> 10 
<211> 28 
<212> DNA 

<213> Artificial Sec[uence 

5 

<220> 

<223> A synthetic primer 
<400> 10 

lOagtcgggccc ctgttcagca gcaatacc 

<210> 11 
<211> 28 
<212> DMA 
15<213> Artificial Sequence 

<220> 

<223> A synthetic primer 

20<400> 11 

actgggatcc gaagtgttcg ctcaagtc 

<210> 12 

<211> 26 
25<212> DNA 

<213> Artificial Sequence 

<220> 

<223> A synthetic primer 

30 

<400> 12 

actgtctaga ttgctcatat tttccc 

<210> 13 
35<211> 740 
<212> PRT 

<213> SARS coronavirus 
<400> 13 

40 Asp Arg Cys Thr Thr Phe Asp Asp Val 
1 5 



28 



28 



26 



Gin Ala Pro Asn Tyr Thr Gin 
10 15 
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His Thr Ser Ser 
20 

Ser Asp Thr Leu 
35 

5Asn Val Thr Gly 
50 

He Pro Phe Lys 
65 

Val Val Arg Gly 

10 

Ser Val He He 
100 

Asn Phe Glu Leu 
115 

15Gly Thr Gin Thr 
130 

Phe Glu Tyr lie 
145 

Gly Asn Phe Lys 

20 

Phe Leu Tyr Val 
180 

Leu Pro Ser Gly 

195 

25Gly He Asn He 
210 

Ala Gin Asp He 
225 

Leu Lys Pro Thr 

30 

Thr Asp Ala Val 
260 

Ser Val Lys Ser 
275 

3 5 Phe Arg Val Val 

290 

Asn Leu Cys Pro 
305 

Val Tyr Ala Trp 

40 

Ser Val Leu Tyr 
340 



Met Arg Gly Val 

Tyr Leu Thr Gin 
40 

Phe His Thr He 

55 

Asp Gly He Tyr 
70 

Trp Val Phe Gly 
85 

He Asn Asn Ser 

Cys Asp Asn Pro 
120 

His Thr Met He 
135 

Ser Asp Ala Phe 

150 

His Leu Arg Glu 
165 

Tyr Lys Gly Tyr 

Phe Asn Thr Leu 
200 

Thr Asn Phe Arg 
215 

Trp Gly Thr Ser 
230 

Thr Phe Met Leu 

245 

Asp Cys Ser Gin 

Phe Glu He Asp 
280 

Pro Ser Gly Asp 

295 

Phe Gly Glu Val 
310 

Glu Arg Lys Lys 
325 

Asn Ser Thr Phe 



10 

Tyr Tyr Pro Asp 

25 

Asp Leu Phe Leu 

Asn His Thr Phe 
60 

Phe Ala Ala Thr 

75 

Ser Thr Met Asn 
90 

Thr Asn Val Val 

105 

Phe Phe Ala Val 

Phe Asp Asn Ala 
140 

Ser Leu Asp Val 

155 

Phe Val Phe Lys 
170 

Gin Pro He Asp 
185 

Lys Pro He Phe 

Ala He Leu Thr 
220 

Ala Ala Ala Tyr 
235 

Lys Tyr Asp Glu 

250 

Asn Pro Leu Ala 
265 

Lys Gly He Tyr 

Val Val Arg Phe 

300 

Phe Asn Ala Thr 
315 

He Ser Asn Cys 
330 

Phe Ser Thr Phe 
345 



Glu He Phe Arg 
30 

Pro Phe Tyr Ser 
45 

Gly Asn Pro Val 

Glu Lys Ser Asn 

80 

Asn Lys Ser Gin 
95 

He Arg Ala Cys 
110 

Ser Lys Pro Met 
125 

Phe Asn Cys Thr 

Ser Glu Lys Ser 
160 

Asn Lys Asp Gly 
175 

val Val Arg Asp 
190 

Lys Leu Pro Leu 

205 

Ala Phe Ser Pro 

Phe Val Gly Tyr 
240 

Asn Gly Thr He 

255 

Glu Leu Lys Cys 
270 

Gin Thr Ser Asn 
285 

Pro Asn He Thr 

Lys Phe Pro Ser 
320 

Val Ala Asp Tyr 
335 

Lys Cys Tyr Gly 
350 
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Val Ser Ala Thr 

355 

Asp Ser Phe Val 
370 

5Gln Thr Gly Val 
385 

Met Gly Cys Val 

Thr Gly Asn Tyr 
10 420 
Arg Pro Phe Glu 
435 

Lys Pro Cys Thr 
450 

15 Tyr Gly Phe Tyr 
465 

Val Val Leu Ser 

Pro Lys Leu Ser 
20 500 
Phe Asn Gly Leu 
515 

Phe Gin Pro Phe 
530 

25Ser Val Arg Asp 
545 

Ala Phe Gly Gly 

Glu Val Ala Val 
30 580 
Ala lie His Ala 
595 

Gly Asn Asn Val 
610 

35His Val Asp Thr 
625 

Cys Ala Ser Tyr 

Ser He Val Ala 
40 660 
Tyr Ser Asn Asn 
675 



Lys Leu Asn Asp 

360 

Val Lys Gly Asp 
375 

He Ala Asp Tyr 
390 

Leu Ala Trp Asn 

405 

Asn Tyr Lys Tyr 

Arg Asp He Ser 
440 

Pro Pro Ala Leu 

455 

Thr Thr Thr Gly 
470 

Phe Glu Leu Leu 
485 

Thr Asp Leu He 

Thr Gly Thr Gly 
520 

Gin Gin Phe Gly 
535 

Pro Lys Thr Ser 
550 

Val Ser Val He 
565 

Leu Tyr Gin Asp 

Asp Gin Leu Thr 
600 

Phe Gin Thr Gin 
615 

Ser Tyr Glu Cys 
630 

His Thr Val Ser 
645 

Tyr Thr Met Ser 

Thr He Ala He 
680 
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Leu Cys Phe Ser 

Asp Val Arg Gin 
380 

Asn Tyr Lys Leu 
395 

Thr Arg Asn He 
410 

Arg Tyr Leu Arg 
425 

Asn Val Pro Phe 

Asn Cys Tyr Trp 
460 

He Gly Tyr Gin 
475 

Asn Ala Pro Ala 
490 

Lys Asn Gin Cys 
505 

Val Leu Thr Pro 

Arg Asp Val Ser 
540 

Glu He Leu Asp 

555 

Thr Pro Gly Thr 
570 

Val Asn Cys Thr 

585 

Pro Ala Trp Arg 

Ala Gly Cys Leu 
620 

Asp He Pro He 
635 

Leu Leu Arg Ser 
650 

Leu Gly Ala Asp 
665 

Pro Thr Asn Phe 



Asn Val Tyr Ala 

365 

He Ala Pro Gly 

Pro Asp Asp Phe 
400 

Asp Ala Thr Ser 

415 

His Gly Lys Leu 
430 

Ser Pro Asp Gly 
445 

Pro Leu Asn Asp 

Pro Tyr Arg Val 
480 

Thr Val Cys Gly 
495 

Val Asn Phe Asn 

510 

Ser Ser Lys Arg 
525 

Asp Phe Thr Asp 

He Ser Pro Cys 
560 

Asn Ala Ser Ser 
575 

Asp Val Ser Thr 
590 

He Tyr Ser Thr 

605 

He Gly Ala Glu 

Gly Ala Gly He 
640 

Thr Ser Gin Lys 
655 

Ser Ser He Ala 
670 

Ser He Ser He 
685 
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Thr Thr Qlu Val Met Pro Val Ser Met Ala I»ys Thr Ser Val Asp Cys 

690 695 700 

Asn Met Tyr lie Cys Gly Asp Ser Thr Glu Cys Ala Asn Leu Leu Leu 
705 710 715 720 

5Gln Tyr Gly Ser Phe Cys Thr Gin Leu Asn Arg Ala Leu Ser Gly He 
725 730 735 

Ala Ala Glu Gin 
740 

10<210> 14 
<211> 429 
<212> PRT 

<213> SARS coronavirus 
15<400> 14 

Glu Val Phe Ala Gin Val Lys Gin Met Tyr Lys Thr Pro Thr Leu Lys 

15 10 15 

Tyr Phe Gly Gly Phe Asn Phe Ser Gin He Leu Pro Asp Pro Leu Lys 
20 25 30 

2 0Pro Thr Lys Arg Ser Phe He Glu Asp Leu Leu Phe Asn Lys Val Thr 

35 40 45 

Leu Ala Asp Ala Gly Phe Met Lys Gin Tyr Gly Glu Cys Leu Gly Asp 

50 55 60 

He Asn Ala Arg Asp Leu He Cys Ala Gin Lys Phe Asn Gly Leu Thr 
2565 70 75 80 

Val Leu Pro Pro Leu Leu Thr Asp Asp Met He Ala Ala Tyr Thr Ala 

85 90 95 

Ala Leu Val Ser Gly Thr Ala Thr Ala Gly Trp Thr Phe Gly Ala Gly 
100 105 110 

3 0Ala Ala Leu Gin He Pro Phe Ala Met Gin Met Ala Tyr Arg Phe Asn 

115 120 125 

Gly He Gly Val Thr Gin Asn Val Leu Tyr Glu Asn Gin Lys Gin He 

130 135 140 

Ala Asn Gin Phe Asn Lys Ala He Ser Gin He Gin Glu Ser Leu Thr 
35145 150 155 160 

Thr Thr Ser Thr Ala Leu Gly Lys Leu Gin Asp Val Val Asn Gin Asn 

165 170 175 

Ala Gin Ala Leu Asn Thr Leu Val Lys Gin Leu Ser Ser Asn Phe Gly 
180 185 190 

40Ala He Ser Ser Val Leu Asn Asp He Leu Ser Arg Leu Asp Lys Val 
195 200 205 
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Glu Ala Glu Val Gin lie Asp Arg Leu He Thr Gly Arg Leu Gin Ser 

210 215 220 

Leu Gin Thr Tyr Val Tiir Gin Gin Leu He Arg Ala Ala Glu He Arg 
225 230 235 240 

5Ala Ser Ala Asn Leu Ala Ala Thr Lys Met Ser Glu Cys Val Leu Gly 
245 250 255 

Gin Ser Lys Arg Val Asp Phe Cys Gly Lys Gly Tyr His Leu Met Ser 

260 265 270 

Phe Pro Gin Ala Ala Pro His Gly Val Val Phe Leu His Val Thr Tyr 
10 275 280 285 

Val Pro Ser Gin Glu Arg Asn Phe Thr Thr Ala Pro Ala He Cys His 

290 295 300 

Glu Gly Lys Ala Tyr Phe Pro Arg Glu Gly Val Phe Val Phe Asn Gly 
305 310 315 320 

ISThr Ser Trp Phe He Thr Gin Arg Asn Phe Phe Ser Pro Gin He He 

325 330 335 

Thr Thr Asp Asn Thr Phe Val Ser Gly Asn Cys Asp Val Val He Gly 

340 345 350 

He He Asn Asn Thr Val Tyr Asp Pro Leu Gin Pro Glu Leu Asp Ser 
20 355 360 365 

Phe Lys Glu Glu Leu Asp Lys Tyr Phe Lys Asn His Thr Ser Pro Asp 

370 375 380 

Val Asp Leu Gly Asp He Ser Gly He Asn Ala Ser Val Val Asn He 
385 390 395 400 

25Gln Lys Glu He Asp Arg Leu Asn Glu Val Ala Lys Asn Leu Asn Glu 

405 410 415 

Ser Leu He Asp Leu Gin Glu Leu Gly Lys Tyr Glu Gin 
420 425 



30<210> 15 
<211> 1170 
<212> PRT 

<213> SARS coronavirus 

35<400> 15 

Asp Arg Cys Thr Thr Phe Asp Asp 

1 5 
His Thr Ser Ser Met Arg Gly Val 
20 

40Ser Asp Thr Leu Tyr Leu Thr Gin 
35 40 



Val Gin Ala Pro Asn Tyr Thr Gin 

10 15 

Tyr Tyr Pro Asp Glu He Phe Arg 

25 30 

Asp Leu Phe Leu Pro Phe Tyr Ser 
45 
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Asn Val Thr Gly 
50 

lie Pro Phe Lys 
65 

5Val Val Arg Gly 

Ser Val lie lie 
100 

Asn Phe Glu Leu 
10 115 

Gly Thr Gin Thr 
130 

Phe Glu Tyr lie 
145 

15Gly Asn Phe Lys 

Phe Leu Tyr Val 
180 

Leu Pro Ser Gly 
20 195 
Gly lie Asn He 
210 

Ala Gin Asp He 

225 

25Leu Lys Pro Thr 

Thr Asp Ala Val 
260 

Ser Val Lys Ser 
30 275 

Phe Arg Val Val 
290 

Asn Leu Cys Pro 
305 

3 5 Val Tyr Ala Trp 

Ser Val Leu Tyr 
340 

Val Ser Ala Thr 
40 355 
Asp Ser Phe Val 
370 



Phe His Thr He 
55 

Asp Gly He Tyr 
70 

Trp Val Phe Gly 
85 

He Asn Asn Ser 

Cys Asp Asn Pro 
120 

His Thr Met He 
135 

Ser Asp Ala Phe 
150 

His Leu Arg Glu 
165 

Tyr Lys Gly Tyr 

Phe Asn Thr Leu 
200 

Thr Asn Phe Arg 
215 

Trp Gly Thr Ser 

230 

Thr Phe Met Leu 
245 

Asp Cys Ser Gin 

Phe Glu He Asp 

280 

Pro Ser Gly Asp 
295 

Phe Gly Glu Val 
310 

Glu Arg Lys Lys 

325 

Asn Ser Thr Phe 

Lys Leu Asn Asp 
360 

Val Lys Gly Asp 
375 
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Asn His Thr Phe 
60 

Phe Ala Ala Thr 
75 

Ser Thr Met Asn 
90 

Thr Asn Val Val 

105 

Phe Phe Ala Val 

Phe Asp Asn Ala 

14 0 

Ser Leu Asp Val 
155 

Phe Val Phe Lys 
170 

Gin Pro He Asp 
185 

Lys Pro He Phe 

Ala He Leu Thr 
220 

Ala Ala Ala Tyr 

235 

Lys Tyr Asp Glu 
250 

Asn Pro Leu Ala 
265 

Lys Gly He Tyr 

Val Val Arg Phe 
300 

Phe Asn Ala Thr 
315 

He Ser Asn Cys 

330 

Phe Ser Thr Phe 
345 

Leu Cys Phe Ser 

Asp Val Arg Gin 
380 



Gly Asn Pro Val 

Glu Lys Ser Asn 
80 

Asn Lys Ser Gin 
95 

He Arg Ala Cys 
110 

Ser Lys Pro Met 
125 

Phe Asn Cys Thr 

Ser Glu Lys Ser 
160 

Asn Lys Asp Gly 
175 

Val Val Arg Asp 

190 

Lys Leu Pro Leu 
205 

Ala Phe Ser Pro 

Phe Val Gly Tyr 

240 

Asn Gly Thr He 
255 

Glu Leu Lys Cys 
270 

Gin Thr Ser Asn 
285 

Pro Asn He Thr 

Lys Phe Pro Ser 
320 

Val Ala Asp Tyr 
335 

Lys Cys Tyr Gly 
350 

Asn Val Tyr Ala 
365 

He Ala Pro Gly 
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Gin Thr Gly Val 
385 

Met Gly Cys Val 

5 Thr Gly Asn Tyr 

420 

Arg Pro Phe Glu 
435 

Lys Pro Cys Thr 
10 450 

Tyr Gly Phe Tyr 

465 

Val Val Leu Ser 

15 Pro Lys Leu Ser 
500 

Phe Asn Gly Leu 

515 

Phe Gin Pro Phe 
20 530 

Ser Val Arg Asp 
545 

Ala Phe Gly Gly 

25G1U Val Ala Val 
580 

Ala lie His Ala 
595 

Gly Asn Asn Val 
30 610 

His Val Asp Thr 
625 

Cys Ala Ser Tyr 

35Ser lie Val Ala 

660 

Tyr Ser Asn Asn 
675 

Thr Thr Glu Val 
40 690 

Asn Met Tyr lie 
705 



lie Ala Asp Tyr 
390 

Leu Ala Tjrp Asn 
405 

Asn Tyr Lys Tyr 

Arg Asp lie Ser 
440 

Pro Pro Ala Leu 
455 

Thr Thr Thr Gly 

470 

Phe Glu Leu Leu 
485 

Thr Asp Leu lie 

Thr Gly Thr Gly 

520 

Gin Gin Phe Gly 
535 

Pro Lys Thr Ser 
550 

Val Ser Val lie 

565 

Leu Tyr Gin Asp 

Asp Gin Leu Thr 
600 

Phe Gin Thr Gin 
615 

Ser Tyr Glu Cys 
630 

His Thr Val Ser 
645 

Tyr Thr Met Ser 

Thr lie Ala lie 
680 

Met Pro Val Ser 
695 

Cys Gly Asp Ser 
710 



15 

Asn Tyr Lys Leu 
395 

Thr Arg Asn lie 
410 

Arg Tyr Leu Arg 

425 

Asn Val Pro Phe 

Asn Cys Tyr Trp 
460 

lie Gly Tyr Gin 

475 

Asn Ala Pro Ala 
490 

Lys Asn Gin Cys 
505 

Val Leu Thr Pro 

Arg Asp Val Ser 
540 

Glu He Leu Asp 
555 

Thr Pro Gly Thr 

570 

Val Asn Cys Thr 
585 

Pro Ala Trp Arg 

Ala Gly Cys Leu 

620 

Asp He Pro He 
635 

Leu Leu Arg Ser 
650 

Leu Gly Ala Asp 

665 

Pro Thr Asn Phe 

Met Ala Lys Thr 
700 

Thr Glu Cys Ala 
715 



Pro Asp Asp Phe 
400 

Asp Ala Thr Ser 
415 

His Gly Lys Leu 

430 

Ser Pro Asp Gly 
445 

Pro Leu Asn Asp 

Pro Tyr Arg Val 
480 

Thr Val Cys Gly 
495 

Val Asn Phe Asn 
510 

Ser Ser Lys Arg 

525 

Asp Phe Thr Asp 

He Ser Pro Cys 
560 

Asn Ala Ser Ser 
575 

Asp Val Ser Thr 
590 

He Tyr Ser Thr 
605 

He Gly Ala Glu 

Gly Ala Gly He 
640 

Thr Ser Gin Lys 
655 

Ser Ser He Ala 

670 

Ser He Ser He 
685 

Ser Val Asp Cys 

Asn Leu Leu Leu 
720 
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Gin Tyr Gly Ser Phe Cys Thr Gin Leu Asn Arg Ala Leu Ser Gly lie 

725 730 735 

Ala Ala Glu Gin Asp Glu Val Phe Ala Gin Val Lys Gin Met Tyr Lys 
740 745 750 

5Thr Pro Thr Leu Lys Tyr Phe Gly Gly Phe Asn Phe Ser Gin lie Leu 
755 760 765 

Pro Asp Pro Leu Lys Pro Thr Lys Arg Ser Phe lie Glu Asp Leu Leu 

770 775 780 

Phe Asn Lys Val Thr Leu Ala Asp Ala Gly Phe Met Lys Gin Tyr Gly 
10785 790 795 800 

Glu Cys Leu Gly Asp lie Asn Ala Arg Asp Leu lie Cys Ala Gin Lys 

805 810 815 

Phe Asn Gly Leu Thr Val Leu Pro Pro Leu Leu Thr Asp Asp Met lie 
820 825 830 

15Ala Ala Tyr Thr Ala Ala Leu Val Ser Gly Thr Ala Thr Ala Gly Trp 
835 840 845 

Thr Phe Gly Ala Gly Ala Ala Leu Gin He Pro Phe Ala Met Gin Met 

850 855 860 

Ala Tyr Arg Phe Asn Gly He Gly Val Thr Gin Asn Val Leu Tyr Glu 
20865 870 875 880 

Asn Gin Lys Gin He Ala Asn Gin Phe Asn Lys Ala He Ser Gin He 

885 890 895 

Gin Glu Ser Leu Thr Thr Thr Ser Thr Ala Leu Gly Lys Leu Gin Asp 
900 905 910 

2 5Val Val Asn Gin Asn Ala Gin Ala Leu Asn Thr Leu Val Lys Gin Leu 
915 920 925 

Ser Ser Asn Phe Gly Ala He Ser Ser Val Leu Asn Asp He Leu Ser 

930 935 940 

Arg Leu Asp Lys Val Glu Ala Glu Val Gin He Asp Arg Leu He Thr 
30945 950 955 960 

Gly Arg Leu Gin Ser Leu Gin Thr Tyr Val Thr Gin Gin Leu He Arg 

965 970 975 

Ala Ala Glu He Arg Ala Ser Ala Asn Leu Ala Ala Thr Lys Met Ser 
980 985 990 

35G1U Cys Val Leu Gly Gin Ser Lys Arg Val Asp Phe Cys Gly Lys Gly 
995 1000 1005 

Tyr His Leu Met Ser Phe Pro Gin Ala Ala Pro His Gly Val Val Phe 

1010 1015 1020 

Leu His Val Thr Tyr Val Pro Ser Gin Glu Arg Asn Phe Thr Thr Ala 
401025 1030 1035 1040 

Pro Ala He Cys His Glu Gly Lys Ala Tyr Phe Pro Arg Glu Gly Val 
1045 1050 1055 
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Phe Val Phe Asn Gly Thr Ser Trp Phe He Thr Gin Arg Asn Phe Phe 

1060 1065 1070 

Ser Pro Gin He He Thr Tlir Asp Asn Thr Phe Val Ser Gly Asn Cys 
1075 1080 1085 

5Asp Val Val He Gly He He Asn Asn Thr Val Tyr Asp Pro Leu Gin 
1090 1095 1100 

Pro Glu Leu Asp Ser Phe Lys Glu Glu Leu Asp Lys Tyr Phe Lys Asn 
1105 1110 1115 1120 

His Thr Ser Pro Asp Val Asp Leu Gly Asp He Ser Gly He Asn Ala 
10 1125 1130 1135 

Ser Val Val Asn He Gin Lys Glu He Asp Arg Leu Asn Glu Val Ala 

1140 1145 1150 

Lys Asn Leu Asn Glu Ser Leu He Asp Leu Gin Glu Leu Gly Lys Tyr 
1155 1160 1165 

ISGlu Gin 
1170 

<210> 16 
<211> 21 
20<212> PRT 

<213> Artificial Sequence 

<220> 

<223> A synthetic k chain leader sequence 

25 

<400> 16 

Met Glu Thr Asp Thr Leu Leu Leu Trp Val Leu Leu Leu Trp Val Pro 

15 10 15 

Gly Ser Thr Gly Asp 
30 20 

<210> 17 
<211> 10 
<212> PRT 
35<213> Artificial Sequence 

<220> 

<223> A synthetic myc epitope 



40<400> 17 

Glu Gin Lys Leu He Ser Glu Glu Asp Leu 
15 10 
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<210> 18 
<211> 6 
<212> PRT 

<213> Artificial Sequence 

5 

<220> 

<223> A synthetic histidine tag 

<400> 18 
lOHis His His His His His 
1 5 

<210> 19 
<211> 24 
15<212> DNA 

<213> Artificial Sequence 

<220> 

<223> A synthetic primer 

20 

<400> 19 

ctagctcgag caacagcatc tgtg 

<210> 20 

25<211> 100 

<212> PRT 

<213> SARS coronavirus 
<400> 20 

3 0Met Phe lie Phe Leu Leu Phe Leu Thr Leu Thr Ser Gly Ser Asp Leu 
15 10 15 

Asp Arg Cys Thr Thr Phe Asp Asp Val Gin Ala Pro Asn Tyr Thr Gin 

20 25 30 

His Thr Ser Ser Met Arg Gly Val Tyr Tyr Pro Asp Glu lie Phe Arg 
35 35 40 45 

Ser Asp Thr Leu Tyr Leu Thr Gin Asp Leu Phe Leu Pro Phe Tyr Ser 

50 55 60 

Asn Val Thr Gly Phe His Thr He Asn His Thr Phe Gly Asn Pro Val 
65 70 75 80 

40Ile Pro Phe Lys Asp Gly He Tyr Phe Ala Ala Thr Glu Lys Ser Asn 

85 90 95 
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Val Val Arg Gly 
100 

<210> 21 
5<211> 100 
<212> PRT 

<213> SARS coronavirus 
<400> 21 

lOTrp Val Phe Gly Ser Thr Met Asn Asn Lys Ser Gin Ser Val lie lie 
15 10 15 

He Asn Asn Ser Thr Asn Val Val He Arg Ala Cys Asn Phe Glu Leu 

20 25 30 

Cys Asp Asn Pro Phe Phe Ala Val Ser Lys Pro Met Gly Thr Gin Thr 
15 35 40 45 

His Thr Met He Phe Asp Asn Ala Phe Asn Cys Thr Phe Glu Tyr He 

50 55 60 

Ser Asp Ala Phe Ser Leu Asp Val Ser Glu Lys Ser Gly Asn Phe Lys 
^5 70 75 80 

2 0His Leu Arg Glu Phe Val Phe Lys Asn Lys Asp Gly Phe Leu Tyr Val 

85 90 95 

Tyr Lys Gly Tyr 
100 

25<210> 22 
<211> 100 
<212> PRT 

<213> SARS coronavirus 

30<400> 22 

Gin Pro He Asp Val Val Arg Asp Leu Pro Ser Gly Phe Asn Thr Leu 

1 5 10 15 

Lys Pro He Phe Lys Leu Pro Leu Gly He Asn He Thr Asn Phe Arg 
20 25 30 

35Ala He Leu Thr Ala Phe Ser Pro Ala Gin Asp He Trp Gly Thr Ser 
35 40 45 

Ala Ala Ala Tyr Phe Val Gly Tyr Leu Lys E»ro Thr Thr Phe Met Leu 

50 55 60 

Lys Tyr Asp Glu Asn Gly Thr He Thr Asp Ala Val Asp Cys Ser Gin 
4065 70 75 80 

Asn Pro Leu Ala Glu Leu Lys Cys Ser Val Lys Ser Phe Glu He Asp 
85 90 95 
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Lys Gly lie Tyr 
100 

<210> 23 
5<211> 100 
<212> PRT 

<213> SARS coronavirus 
<400> 23 

lOGln Thr Ser Asn Phe Arg Val Val Pro Ser Gly Asp Val Val Arg Phe 
=^ 5 10 15 

Pro Asn He Thr Asn Leu Cys Pro Phe Gly Glu Val Phe Asn Ala Thr 

20 25 30 

Lys Phe Pro Ser Val Tyr Ala Trp Glu Arg Lys Lys He Ser Asn Cys 
15 35 40 45 

Val Ala Asp Tyr Ser Val Leu Tyr Asn Ser Thr Phe Phe Ser Thr Phe 

50 55 60 

Lys Cys Tyr Gly Val Ser Ala Thr Lys Leu Asn Asp Leu Cys Phe Ser 

70 75 80 

20Asn Val Tyr Ala Asp Ser Phe Val Val Lys Gly Asp Asp Val Arg Gin 

85 90 95 

He Ala Pro Gly 
100 

25<210> 24 
<211> 100 
<212> PRT 

<213> SARS coronavirus 

30<400> 24 

Gin Thr Gly Val He Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe 

15 10 15 

Met Gly Cys Val Leu Ala Trp Asn* Thr Arg Asn He Asp Ala Thr Ser 
20 25 30 

35Thr Gly Asn Tyr Asn Tyr Lys Tyr Arg Tyr Leu Arg His Gly Lys Leu 
35 40 45 

Arg Pro Phe Glu Arg Asp He Ser Asn Val Pro Phe Ser Pro Asp Gly 

50 55 60 

Lys Pro Cys Thr Pro Pro Ala Leu Asn Cys Tyr Trp Pro Leu Asn Asp 
4065 70 75 80 

Tyr Gly Phe Tyr Thr Thr Thr Gly He Gly Tyr Gin Pro Tyr Arg Val 
85 90 95 
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Val Val Leu Ser 
100 

<210> 25 
5<211> 100 
<212> PRT 

<213> SARS coronavirus 
<400> 25 

lOPhe Glu Leu Leu Asn Ala Pro Ala Thr Val Cys Gly Pro Lys Leu Ser 
15 10 15 

Thr Asp Leu lie Lys Asn Gin Cys Val Asn Phe Asn Phe Asn Gly Leu 

20 25 30 

Thr Gly Thr Gly Val Leu Thr Pro Ser Ser Lys Arg Phe Gin Pro Phe 
15 35 40 45 

Gin Gin Phe Gly Arg Asp Val Ser Asp Phe Thr Asp Ser Val Arg Asp 

50 55 60 

Pro Lys Thr Ser Glu lie Leu Asp lie Ser Pro Cys Ala Phe Gly Gly 
65 70 75 80 

2 0Val Ser Val lie Thr Pro Gly Thr Asn Ala Ser Ser Glu Val Ala Val 

85 90 95 

Leu Tyr Gin Asp 
100 

25<210> 26 
<211> 100 
<212> PRT 

<213> SARS coronavirus 
30<400> 26 

Val Asn Cys Thr Asp Val Ser Thr Ala He His Ala Asp Gin Leu Thr 

15 10 15 

Pro Ala Trp Arg He Tyr Ser Thr Gly Asn Asn Val Phe Gin Thr Gin 
20 25 30 

35Ala Gly Cys Leu He Gly Ala Glu His Val Asp Thr Ser Tyr Glu Cys 
35 40 45 

Asp He Pro He Gly Ala Gly He Cys Ala Ser Tyr His Thr Val Ser 

50 55 60 

Leu Leu Arg Ser Thr Ser Gin Lys Ser He Val Ala Tyr Thr Met Ser 
4065 70 75 80 

Leu Gly Ala Asp Ser Ser He Ala Tyr Ser Asn Asn Thr He Ala He 
85 90 95 
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Pro Thr Asn Phe 
100 

<210> 27 
5<211> 100 
<212> PRT 

<213> SARS coronavirus 

<400> 27 

lOSer lie Ser lie Thr Thr Glu Val Met Pro Val Ser Met Ala Lys Thr 
15 10 15 

Ser Val Asp Cys Asn Met Tyr lie Cys Gly Asp Ser Thr Glu Cys Ala 

20 25 30 

Asn Leu Leu Leu Gin Tyr Gly Ser Phe Cys Thr Gin Leu Asn Arg Ala 
15 35 40 45 

Leu Ser Gly lie Ala Ala Glu Gin Asp Arg Asn Thr Arg Glu Val Phe 

50 55 60 

Ala Gin Val Lys Gin Met Tyr Lys Thr Pro Thr Leu Lys Tyr Phe Gly 
65 70 75 80 

2 0Gly Phe Asn Phe Ser Gin lie Leu Pro Asp Pro Leu Lys Pro Thr Lys 

85 90 95 

Arg Ser Phe lie 
100 

25<210> 28 
<211> 100 
<212> PRT 

<213> SARS coronavirus 

30<400> 28 

Glu Asp Leu Leu Phe Asn Lys Val Thr Leu Ala Asp Ala Gly Phe Met 

15 10 15 

Lys Gin Tyr Gly Glu Cys Leu Gly Asp lie Asn Ala Arg Asp Leu He 
20 25 30 

3 5 Cys Ala Gin Lys Phe Asn Gly Leu Thr Val Leu Pro Pro Leu Leu Thr 
35 40 45 

Asp Asp Met He Ala Ala Tyr Thr Ala Ala Leu Val Ser Gly Thr Ala 

50 55 60 

Thr Ala Gly Trp Thr Phe Gly Ala Gly Ala Ala Leu Gin He Pro Phe 
4065 70 75 80 

Ala Met Gin Met Ala Tyr Arg Phe Asn Gly He Gly Val Thr Gin Asn 
85 90 95 
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Val Leu Tyr Glu 
100 



<210> 29 
5<211> 100 
<212> PRT 

<213> SARS coronavirus 



<400> 29 

lOAsn Gin Lys Gin lie Ala Asn Gin Phe Asn Lys Ala He Ser Gin He 
15 10 15 

Gin Glu Ser Leu Thr Thr Thr Ser Thr Ala Leu Gly Lys Leu Gin Asp 

20 25 30 

Val Val Asn Gin Asn Ala Gin Ala Leu Asn Thr Leu Val Lys Gin Leu 
15 35 ' 40 45 

Ser Ser Asn Phe Gly Ala He Ser Ser Val Leu Asn Asp lie Leu Ser 

50 55 60 

Arg Leu Asp Lys Val Glu Ala Glu Val Gin He Asp Arg Leu He Thr 
65 70 75 80 

20Gly Arg Leu Gin Ser Leu Gin Thr Tyr Val Thr Gin Gin Leu He Arg 

85 90 95 

Ala Ala Glu He 
100 



25<210> 30 
<211> 100 
<212> PRT 

<213> SARS coronavirus 



30<400> 30 

Arg Ala Ser Ala 
1 

Gly Gin Ser Lys 
20 

35Ser Phe Pro Gin 

35 

Tyr Val Pro Ser 
50 

His Glu Gly Lys 
4065 

Gly Thr Ser Trp 



Asn Leu Ala Ala 
5 

Arg Val Asp Phe 

Ala Ala Pro His 

40 

Gin Glu Arg Asn 
55 

Ala Tyr Phe Pro 
70 

Phe He Thr Gin 
85 



Thr Lys Met Ser 
10 

Cys Gly Lys Gly 
25 

Gly Val Val Phe 

Phe Thr Thr Ala 
60 

Arg Glu Gly Val 
75 

Arg Asn Phe Phe 
90 



Glu Cys Val Leu 
15 

Tyr His Leu Met 
30 

Leu His Val Thr 

45 

Pro Ala He Cys 

Phe Val Phe Asn 
80 

Ser Pro Gin He 
95 
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lie Thr Thr Asp 
100 



<210> 31 
5<211> SO 
<212> PRT 

<213> SARS coronavirus 



<400> 31 

lOAsn Thr Phe Val Ser Gly Asn Cys 
1 5 
Asn Thr Val Tyr Asp Pro Leu Gin 
20 

Glu Leu Asp Lys Tyr Phe Lys Asn 
15 35 40 

Gly Asp lie Ser Gly He Asn Ala 

50 55 
He Asp Arg Leu Asn Glu Val Ala 
65 70 
20Asp Leu Gin Glu Leu Gly Lys Tyr 

85 



Asp Val Val He Gly He He Asn 

10 15 
Pro Glu Leu Asp Ser Phe Lys Glu 

25 30 
His Thr Ser Pro Asp Val Asp Leu 
45 

Ser Val Val Asn He Gin Lys Glu 
60 

Lys Asn Leu Asn Glu Ser Leu He 
75 ' 80 

Glu Gin 
90 



<210> 32 
<211> 200 
25<212> PRT 

<213> SARS coronavirus 

<400> 32 

Met Phe He Phe Leu Leu Phe Leu Thr Leu Thr Ser Gly Ser Asp Leu 
30 1 5 10 15 

Asp Arg Cys Thr Thr Phe Asp Asp Val Gin Ala Pro Asn Tyr Thr Gin 

20 25 30 

His Thr Ser Ser Met Arg Gly Val Tyr Tyr Pro Asp Glu He Phe Arg 
35 40 45 

35Ser Asp Thr Leu Tyr Leu Thr Gin Asp Leu Phe Leu Pro Phe Tyr Sex- 
50 55 60 

Asn Val Thr Gly Phe His Thr He Asn His Thr Phe Gly Asn Pro Val 
65 70 75 80 

He Pro Phe Lys Asp Gly He Tyr Phe Ala Ala Thr Glu Lys Ser Asn 
40 85 90 95 

Val Val Arg Gly Trp Val Phe Gly Ser Thr Met Asn Asn Lys Ser Gin 
100 105 110 
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Ser Val lie lie 
115 

Asn Phe Glu Leu 
130 

5Gly Thr Gin Thr 
145 

Phe Glu Tyr lie 

Gly Asn Phe Lys 
10 180 
Phe Leu Tyr Val 
195 



lie Asn Asn Ser 
120 

Cys Asp Asn Pro 
135 

His Thr Met He 
150 

Ser Asp Ala Phe 

165 

His Leu Arg Glu 

Tyr Lys Gly Tyr 
200 



25 

Thr Asn Val Val 

Phe Phe Ala Val 
140 

Phe Asp Asn Ala 
155 

Ser Leu Asp Val 

170 

Phe Val Phe Lys 
185 



He Arg Ala Cys 

125 

Ser Lys Pro Met 

Phe Asn Cys Thr 
160 

Ser Glu Lys Ser 
175 

Asn Lys Asp Gly 
190 



<210> 33 
15<211> 200 

<212> PRT ^ 
<213> SARS coronavirus 



<400> 33 

2 0Gln Pro He Asp Val Val Arg Asp Leu Pro Ser Gly Phe Asn Thr Leu 
15 10 15 

Lys Pro He Phe Lys Leu Pro Leu Gly He Asn He Thr Asn Phe Arg 

20 25 30 

Ala He Leu Thr Ala Phe Ser Pro Ala Gin Asp He Trp Gly Thr Ser 
25 35 40 45 

Ala Ala Ala Tyr Phe Val Gly Tyr Leu Lys Pro Thr Thr Phe Met Leu 

50 55 60 

Lys Tyr Asp Glu Asn Gly Thr He Thr Asp Ala Val Asp Cys Ser Gin 
65 70 75 80 

30Asn Pro Leu Ala Glu Leu Lys Cys Ser Val Lys Ser Phe Glu He Asp 

85 90 95 

Lys Gly He Tyr Gin Thr Ser Asn Phe Arg Val Val Pro Ser Gly Asp 

100 105 110 

Val Val Arg Phe Pro Asn He Thr Asn Leu Cys Pro Phe Gly Glu Val 
35 115 120 125 

Phe Asn Ala Thr Lys Phe Pro Ser Val Tyr Ala Trp Glu Arg Lys Lys 

130 135 140 

He Ser Asn Cys Val Ala Asp Tyr Ser Val Leu Tyr Asn Ser Thr Phe 
145 150 155 160 

40Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Ala Thr Lys Leu Asn Asp 

165 170 175 



wo 2005/010034 PCT/US2004/023345 

25 

Leu Cys Phe Ser Asn Val Tyr Ala Asp Ser Phe Val Val Lys Gly Asp 

180 185 190 

Asp Val Arg Gin lie Ala Pro Gly 
195 200 

5 

<210> 34 
<211> 200 
<212> PRT 

<213> SARS coronavirus 

10 

<400> 34 

Gin Thr Gly Val lie Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe 

1 5 ' 10 15 

Met Gly Cys Val Leu Ala Trp Asn Thr Arg Asn lie Asp Ala Thr Ser 
15 20 25 30 

Thr Gly Asn Tyr Asn Tyr Lys Tyr Arg Tyr Leu Arg His Gly Lys Leu 

35 40 45 

Arg Pro Phe Glu Arg Asp lie Ser Asn Val Pro Phe Ser Pro Asp Gly 
50 55 60 

2 0Lys Pro Cys Thr Pro Pro Ala Leu Asn Cys Tyr Trp Pro Leu Asn Asp 
65 70 75 80 

Tyr Gly Phe Tyr Thr Thr Thr Gly lie Gly Tyr Gin Pro Tyr Arg Val 

85 90 95 

Val Val Leu Ser Phe Glu Leu Leu Asn Ala Pro Ala Thr Val Cys Gly 
25 100 105 110 

Pro Lys Leu Ser Thr Asp Leu lie Lys Asn Gin Cys Val Asn Phe Asn 

115 120 125 

Phe Asn Gly Leu Thr Gly Thr Gly Val Leu Thr Pro Ser Ser Lys Arg 
130 135 140 

30Phe Gin Pro Phe Gin Gin Phe Gly Arg Asp Val Ser Asp Phe Thr Asp 
145 150 155 150 

Ser Val Arg Asp Pro Lys Thr Ser Glu lie Leu Asp lie Ser Pro Cys 

165 170 175 

Ala Phe Gly Gly Val Ser Val He Thr Pro Gly Thr Asn Ala Ser Ser 
35 180 185 190 

Glu Val Ala Val Leu Tyr Gin Asp 
195 200 



<210> 35 
40<211> 200 

<212> PRT 

<213> SARS coronavirus 
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<400> 35 

Val Asn Cys Thr Asp Val Ser Thr Ala lie His Ala Asp Gin Leu Thr 

15 10 15 

Pro Ala Trp Arg lie Tyr Ser Thr Gly Asn Asn Val Phe Gin Thr Gin 
5 20 25 30 

Ala Gly Cys Leu lie Gly Ala Glu His Val Asp Thr Ser Tyr Glu Cys 

35 40 45 

Asp lie Pro lie Gly Ala Gly lie Cys Ala Ser Tyr His Thr Val Ser 
50 55 60 

lOLeu Leu Arg Ser Thr Ser Gin Lys Ser lie Val Ala Tyr Thr Met Ser 
65 70 75 80 

Leu Gly Ala Asp Ser Ser lie Ala Tyr Ser Asn Asn Thr He Ala He 

85 90 95 

Pro Thr Asn Phe Ser He Ser He Thr Thr Glu Val Met Pro Val Ser 
15 100 105 110 

Met Ala Lys Thr Ser Val Asp Cys Asn Met Tyr He Cys Gly Asp Ser 

115 120 125 

Thr Glu Cys Ala Asn Leu Leu Leu Gin Tyr Gly Ser Phe Cys Thr Gin 
130 135 140 

2 0Leu Asn Arg Ala Leu Ser Gly He Ala Ala Glu Gin Asp Arg Asn Thr 
145 150 155 160 

Arg Glu Val Phe Ala Gin Val Lys Gin Met Tyr Lys Thr Pro Thr Leu 

165 170 175 

Lys Tyr Phe Gly Gly Phe Asn Phe Ser Gin He Leu Pro Asp Pro Leu 
25 180 185 190 

Lys Pro Thr Lys Arg Ser Phe He 
195 200 



<210> 36 
30<211> 200 
<212> PRT 

<213> SARS coronavirus 



<400> 36 

35Glu Asp Leu Leu Phe Asn Lys Val 
1 5 
Lys Gin Tyr Gly Glu Cys Leu Gly 
20 

Cys Ala Gin Lys Phe Asn Gly Leu 
40 35 40 

Asp Asp Met He Ala Ala Tyr Thr 
50 55 



Thr Leu Ala Asp Ala Gly Phe Met 

10 15 
Asp He Asn Ala Arg Asp Leu He 
25 30 
Thr Val Leu Pro Pro Leu Leu Thr 
45 

Ala Ala Leu Val Ser Gly Thr Ala 
60 
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Thr Ala Gly Trp Thr 
65 

Ala Met Gin Met Ala 
85 

5Val Leu Tyr Glu Asn 
100 

lie Ser Gin lie Gin 
115 

Lys Leu Gin Asp Val 
10 130 

Val Lys Gin Leu Ser 

145 

Asp lie Leu Ser Arg 
165 

15 Arg Leu lie Thr Gly 
180 

Gin Leu lie Arg Ala 
195 



28 

Phe Gly Ala Gly Ala 
70 

Tyr Arg Phe Asn Gly 
90 

Gin Lys Gin He Ala 
105 

Glu Ser Leu Thr Thr 
120 

Val Asn Gin Asn Ala 
135 

Ser Asn Phe Gly Ala 

150 

Leu Asp Lys Val Glu 
170 

Arg Leu Gin Ser Leu 
185 

Ala Glu He 
200 



Ala Leu Gin He Pro Phe 
75 80 
lie Gly Val Thr Gin Asn 
95 

Asn Gin Phe Asn Lys Ala 
110 

Thr Ser Thr Ala Leu Gly 
125 

Gin Ala Leu Asn Thr Leu 
140 

He Ser Ser Val Leu Asn 

155 160 
Ala Glu Val Gin He Asp 
175 

Gin Thr Tyr Val Thr Gin 
190 



20<210> 37 
<211> 190 
<212> PRT 

<213> SARS coronavirus 



25<400> 37 

Arg Ala Ser Ala Asn Leu Ala Ala Thr Lys Met Ser Glu Cys Val Leu 

15 10 15 

Gly Gin Ser Lys Arg Val Asp Phe Cys Gly Lys Gly Tyr His Leu Met 
20 25 30 

3 0Ser Phe Pro Gin Ala Ala Pro His Gly Val Val Phe Leu His Val Thr 

35 40 45 

Tyr Val Pro Ser Gin Glu Arg Asn Phe Thr Thr Ala Pro Ala He Cys 

50 55 60 

His Glu Gly Lys Ala Tyr Phe Pro Arg Glu Gly Val Phe Val Phe Asn 
3565 70 75 80 

Gly Thr Ser Trp Phe He Thr Gin Arg Asn Phe Phe Ser Pro Gin He 

85 90 95 

He Thr Thr Asp Asn Thr Phe Val Ser Gly Asn Cys Asp Val Val He 
100 105 110 

4 0Gly He He Asn Asn Thr Val Tyr Asp Pro Leu Gin Pro Glu Leu Asp 

115 120 125 
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Ser Phe Lys Glu Glu 
130 

Asp Val Asp Leu Gly 
145 

5 lie Gin Lys Glu lie 
165 

Glu Ser Leu lie Asp 
180 



29 

Leu Asp Lys Tyr Ph.e 

135 

Asp He Ser Gly He 
150 

Asp Arg Leu Asn Glu 

170 

Leu Gin Glu Leu Gly 
185 



Lys Asn His Thr Ser Pro 
140 

Asn Ala Ser Val Val Asn 
155 160 
Val Ala Lys Asn Leu Asn 
175 

iys Tyr Glu Gin 
190 



10<210> 38 
<211> 400 

<212> PRT 

<213> SARS coronavirus 



15<400> 38 

Met Phe He Phe Leu Leu Phe Leu Thr Leu Thr Ser Gly Ser Asp Leu 

15 10 15 

Asp Arg Cys Thr Thr Phe Asp Asp Val Gin Ala Pro Asn Tyr Thr Gin 
20 25 30 

2 0His Thr Ser Ser Met Arg Gly Val Tyr Tyr Pro Asp Glu He Phe Arg 
35 40 45 

Ser Asp Thr Leu Tyr Leu Thr Gin Asp Leu Phe Leu Pro Phe Tyr Ser 

50 55 60 

Asn Val Thr Gly Phe His Thr He Asn His Thr Phe Gly Asn Pro Val 
2565 70 75 80 

He Pro Phe Lys Asp Gly He Tyr Phe Ala Ala Thr Glu Lys Ser Asn 

85 90 95 

Val Val Arg Gly Trp Val Phe Gly Ser Thr Met Asn Asn Lys Ser Gin 
100 105 110 

30Ser Val He He He Asn Asn Ser Thr Asn Val Val He Arg Ala Cys 

115 120 125 

Asn Phe Glu Leu Cys Asp Asn Pro Phe Phe Ala Val Ser Lys Pro Met 

130 135 140 

Gly Thr Gin Thr His Thr Met He Phe Asp Asn Ala Phe Asn Cys Thr 
35145 150 155 160 

Phe Glu Tyr He Ser Asp Ala Phe Ser Leu Asp Val Ser Glu Lys Ser 

165 170 175 

Gly Asn Phe Lys His Leu Arg Glu Phe Val Phe Lys Asn Lys Asp Gly 
180 185 190 

40Phe Leu Tyr Val Tyr Lys Gly Tyr Gin Pro He Asp Val Val Arg Asp 
195 200 205 
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Leu Pro Ser Gly Phe Asn Thr Leu Lys Pro lie Phe Lys Leu Pro Leu 

210 215 220 

Gly lie Asn lie Thr Asn Phe Arg Ala lie Leu Thr Ala Phe Ser Pro 
225 230 235 240 

5Ala Gin Asp lie Trp Gly Thr Ser Ala Ala Ala Tyr Phe Val Gly Tyr 
245 250 255 

Leu Lys Pro Thr Thr Phe Met Leu Lys Tyr Asp Glu Asn Gly Thr lie 

260 265 270 

Thr Asp Ala Val Asp Cys Ser Gin Asn Pro Leu Ala Glu Leu Lys Cys 
10 275 280 285 

Ser Val Lys Ser Phe Glu lie Asp Lys Gly lie Tyr Gin Thr Ser Asn 

290 295 300 

Phe Arg Val Val Pro Ser Gly Asp Val Val Arg Phe Pro Asn lie Thr 
305 310 315 320 

15Asn Leu Cys Pro Phe Gly Glu Val Phe Asn Ala Thr Lys Phe Pro Ser 

325 330 335 

Val Tyr Ala Trp Glu Arg Lys Lys lie Ser Asn Cys Val Ala Asp Tyr 

340 345 350 

Ser Val Leu Tyr Asn Ser Thr Phe Phe Ser Thr Phe Lys Cys Tyr Gly 
20 355 360 365 

Val Ser Ala Thr Lys Leu Asn Asp Leu Cys Phe Ser Asn Val Tyr Ala 

370 375 380 

Asp Ser Phe Val Val Lys Gly Asp Asp Val Arg Gin lie Ala Pro Gly 
385 390 395 400 

25 

<210> 39 
<211> 600 
<212> PRT 

<213> SARS coronavirus 

30 

<400> 39 

Met Phe lie Phe Leu Leu Phe Leu Thr Leu Thr Ser Gly Ser Asp Leu 

15 10 15 

Asp Arg Cys Thr Thr Phe Asp Asp Val Gin Ala Pro Asn Tyr Thr Gin 
35 20 25 30 

His Thr Ser Ser Met Arg Gly Val Tyr Tyr Pro Asp Glu lie Phe Arg 

35 40 45 " 

Ser Asp Thr Leu Tyr Leu Thr Gin Asp Leu Phe Leu Pro Phe Tyr Ser 
50 55 60 

40Asn Val Thr Gly Phe His Thr lie Asn His Thr Phe Gly Asn Pro Val 
65 70 75 80 
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lie Pro Phe Lys Asp Gly lie Tyr Phe Ala Ala Th.r Glu Lys Ser Asn 

85 90 95 

Val Val Arg Gly Trp Val Phe Gly Ser Thr Met Asn Asn Lys Ser Gin 
IGO 105 110 

SSer Val lie lie lie Asn Asn Ser Thr Asn Val Val He Arg Ala Cys 
115 120 125 

Asn Phe Glu Leu Cys Asp Asn Pro Phe Phe Ala Val Ser Lys Pro Met 

130 135 140 

Gly Thr Gin Thr His Thr Met He Phe Asp Asn Ala Phe Asn Cys Thr 
10145 150 155 160 

Phe Glu Tyr He Ser Asp Ala Phe Ser Leu Asp Val Ser Glu Lys Ser 

165 170 175 

Gly Asn Phe Lys His Leu Arg Glu Phe Val Phe Lys Asn Lys Asp Gly 
180 185 190 

ISPhe Leu Tyr Val Tyr Lys Gly Tyr Gin Pro He Asp Val Val Arg Asp 
195 200 205 

Leu Pro Ser Gly Phe Asn Thr Leu Lys Pro He Phe Lys Leu Pro Leu 

210 215 220 

Gly He Asn He Thr Asn Phe Arg Ala He Leu Thr Ala Phe Ser Pro 
20225 230 235 240 

Ala Gin Asp He Trp Gly Thr Ser Ala Ala Ala Tyr Phe Val Gly Tyr 

245 250 255 

Leu Lys Pro Thr Thr Phe Met Leu Lys Tyr Asp Glu Asn Gly Thr He 
260 265 270 

25Thr Asp Ala Val Asp Cys Ser Gin Asn Pro Leu Ala Glu Leu Lys Cys 
275 280 285 

Ser Val Lys Ser Phe Glu He Asp Lys Gly He Tyr Gin Thr Ser Asn 

290 295 300 

Phe Arg Val Val Pro Ser Gly Asp Val Val Arg Phe Pro Asn He Thr 
30305 310 315 320 

Asn Leu Cys Pro Phe Gly Glu Val Phe Asn Ala Thr Lys Phe Pro Ser 

325 330 335 

Val Tyr Ala Txp Glu Arg Lys Lys He Ser Asn Cys Val Ala Asp Tyr 
340 345 350 

3 5 Ser Val Leu Tyr Asn Ser Thr Phe Phe Ser Thr Phe Lys Cys Tyr Gly 
355 360 365 

Val Ser Ala Thr Lys Leu Asn Asp Leu Cys Phe Ser Asn Val Tyr Ala 

370 375 380 

Asp Ser Phe Val Val Lys Gly Asp Asp Val Arg Gin He Ala Pro Gly 
40385 390 395 400 

Gin Thr Gly Val He Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe 
405 410 415 
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Met Gly Cys Val Leu Ala Trp Asn Thr Arg Asn He Asp Ala Thr Ser 

420 425 430 

Tlir Gly Asn Tyr Asn Tyr Lys Tyr Arg Tyr Leu Arg His Gly Lys Leu 
435 440 445 

5Arg Pro Phe Glu Arg Asp He Ser Asn Val Pro Phe Ser Pro Asp Gly 
450 455 460 

Lys Pro Cys Thr Pro Pro Ala Leu Asn Cys Tyr Trp Pro Leu Asn Asp 
465 470 475 480 

Tyr Gly Phe Tyr Thr Thr Thr Gly He Gly Tyr Gin Pro Tyr Arg Val 
10 485 490 495 

Val Val Leu Ser Phe Glu Leu Leu Asn Ala Pro Ala Thr Val Cys Gly 

500 505 510 

Pro Lys Leu Ser Thr Asp Leu He Lys Asn Gin Cys Val Asn Phe Asn 
515 520 525 

ISPhe Asn Gly Leu Thr Gly Thr Gly Val Leu Thr Pro Ser Ser Lys Arg 
530 535 540 

Phe Gin Pro Phe Gin Gin Phe Gly Arg Asp Val Ser Asp Phe Thr Asp 
545 550 555 560 

Ser Val Arg Asp Pro Lys Thr Ser Glu He Leu Asp He Ser Pro Cys 
20 565 570 575 

Ala Phe Gly Gly Val Ser Val He Thr Pro Gly Thr Asn Ala Ser Ser 

580 585 590 

Glu Val Ala Val Leu Tyr Gin Asp 
595 600 

25 

<210> 40 
<211> 800 
<212> PRT 

<213> SARS coronavirus 

30 

<400> 40 

Met Phe He Phe Leu Leu Phe Leu Thr Leu Thr Ser Gly Ser Asp Leu 

15 10 15 

Asp Arg Cys Thr Thr Phe Asp Asp Val Gin Ala Pro Asn Tyr Thr Gin 
35 20 25 30 

His Thr Ser Ser Met Arg Gly Val Tyr Tyr Pro Asp Glu He Phe Arg 

35 40 45 

Ser Asp Thr Leu Tyr Leu Thr Gin Asp Leu Phe Leu Pro Phe Tyr Ser 
50 55 60 

4 0Asn Val Thr Gly Phe His Thr He Asn His Thr Phe Gly Asn Pro Val 
65 70 75 80 
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lie Pro Phe Lys Asp Gly He Tyr Phe Ala Ala Thr Glu Lys Ser Asn 

85 90 95 

Val Val Arg Gly Trp Val Phe Gly Ser Thr Met Asn Asn Lys Ser Gin 
100 105 110 

5Ser Val He He He Asn Asn Ser Thr Asn Val Val He Arg Ala Cys 
115 120 125 

Asn Phe Glu Leu Cys Asp Asn Pro Phe Phe Ala Val Ser Lys Pro Met 

130 135 140 

Gly Thr Gin Thr His Thr Met He Phe Asp Asn Ala Phe Asn Cys Thr 
10145 150 155 160 

Phe Glu Tyr He Ser Asp Ala Phe Ser Leu Asp Val Ser Glu Lys Ser 

165 170 175 

Gly Asn Phe Lys Hia Leu Arg Glu Phe Val Phe Lys Asn Lys Asp Gly 
180 185 190 

ISPhe Leu Tyr Val Tyr Lys Gly Tyr Gin Pro He Asp Val Val Arg Asp 
195 200 205 

Leu Pro Ser Gly Phe Asn Thr Leu Lys Pro He Phe Lys Leu Pro Leu 

210 215 220 

Gly He Asn He Thr Asn Phe Arg Ala He Leu Thr Ala Phe Ser Pro 
20225 230 235 240 

Ala Gin Asp He Trp Gly Thr Ser Ala Ala Ala Tyr Phe Val Gly Tyr 

245 250 255 

Leu Lys Pro Thr Thr Phe Met Leu Lys Tyr Asp Glu Asn Gly Thr He 
260 265 270 

25Thr Asp Ala Val Asp Cys Ser Gin Asn Pro Leu Ala Glu Leu Lys Cys 
275 280 285 

Ser Val Lys Ser Phe Glu He Asp Lys Gly He Tyr Gin Thr Ser Asn 

290 295 300 

Phe Arg Val Val Pro Ser Gly Asp Val Val Arg Phe Pro Asn He Thr 
30305 310 315 320 

Asn Leu Cys Pro Phe Gly Glu Val Phe Asn Ala Thr Lys Phe Pro Ser 

325 330 335 

Val Tyr Ala Trp Glu Arg Lys Lys He Ser Asn Cys Val Ala Asp Tyr 
340 345 350 

35Ser Val Leu Tyr Asn Ser Thr Phe Phe Ser Thr Phe Lys Cys Tyr Gly 
355 360 365 

Val Ser Ala Thr Lys Leu Asn Asp Leu Cys Phe Ser Asn Val Tyr Ala 

370 375 380 

Asp Ser Phe Val Val Lys Gly Asp Asp Val Arg Gin He Ala Pro Gly 
40385 390 395 400 

Gin Thr Gly Val He Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe 
405 410 415 
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Met Gly Cys Val 

420 

Thr Gly Asn Tyr 
435 

5Arg Pro Phe Glu 
450 

Lys Pro Cys Tlir 

465 

Tyr Gly Phe Tyr 

10 

Val Val Leu Ser 
500 

Pro Lys Leu Ser 
515 

15 Phe Asn Gly Leu 
530 

Phe Gin Pro Phe 

545 

Ser Val Arg Asp 

20 

Ala Phe Gly Gly 
580 

Glu Val Ala Val 

595 

25Ala lie His Ala 
610 

Gly Asn Asn Val 
625 

His Val Asp Thr 

30 

Cys Ala Ser Tyr 
660 

Ser He Val Ala 
675 

35Tyr Ser Asn Asn 
690 

Thr Thr Glu Val 
705 

Asn Met Tyr He 

40 

Gin Tyr Gly Ser 
74 0 



Leu Ala Trp Asn 

Asn Tyr Lys Tyr 
440 

Arg Asp He Ser 
455 

Pro Pro Ala Leu 

470 

Thr Thr Thr Gly 
485 

Phe Glu Leu Leu 

Thr Asp Leu He 
520 

Thr Gly Thr Gly 
535 

Gin Gin Phe Gly 
550 

Pro Lys Thr Ser 
565 

Val Ser Val He 

Leu Tyr Gin Asp 
600 

Asp Gin Leu Thr 
615 

Phe Gin Thr Gin 
630 

Ser Tyr Glu Cys 

645 

His Thr val Ser 

Tyr Thr Met Ser 
680 

Thr He Ala He 

695 

Met Pro Val Ser 
710 

Cys Gly Asp Ser 
725 

Phe Cys Thr Gin 



34 

Thr Arg Asn He 

425 

Arg Tyr Leu Arg 

Asn Val Pro Phe 
460 

Asn Cys Tyr Trp 

475 

He Gly Tyr Gin 
490 

Asn Ala Pro Ala 
505 

Lys Asn Gin Cys 

Val Leu Thr Pro 
540 

Arg Asp Val Ser 
555 

Glu He Leu Asp 

570 

Thr Pro Gly Thr 
585 

Val Asn Cys Thr 

Pro Ala Trp Arg 

620 

Ala Gly Cys Leu 
635 

Asp He Pro He 
650 

Leu Leu Arg Ser 
665 

Leu Gly Ala Asp 

Pro Thr Asn Phe 
700 

Met Ala Lys Thr 
715 

Thr Glu Cys Ala 
730 

Leu Asn Arg Ala 
745 



Asp Ala Thr Ser 

430 

His Gly Lys Leu 
445 

Ser Pro Asp Gly 

Pro Leu Asn Asp 

480 

Pro Tyr Arg Val 
495 

Thr Val Cys Gly 
510 

Val Asn Phe Asn 
525 

Ser Ser Lys Arg 

Asp Phe Thr Asp 
560 

He Ser Pro Cys 

575 

Asn Ala Ser Ser 
590 

Asp Val Ser Thr 
605 

He Tyr Ser Thr 

He Gly Ala Glu 
640 

Gly Ala Gly He 
655 

Thr Ser Gin Lys 

670 

Ser Ser He Ala 
685 

Ser He Ser He 

Ser Val Asp Cys 

720 

Asn Leu Leu Leu 
735 

Leu Ser Gly He 
750 
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Ala Ala Glu Gin Asp Arg Asn Thr 
755 760 
Gin Met Tyr Lys Thr Pro Thr Leu 
770 775 
5Ser Gin He Leu Pro Asp Pro Leu 
785 790 



35 

Arg Glu Val Phe Ala Gin Val Lys 
765 

Lys Tyr Phe Gly Gly Phe Asn Phe 
780 

Lys Pro Thr Lys Arg Ser Phe He 
795 800 



<210> 41 

<211> 1000 

10<212> PRT 

<213> SARS 



coronavirus 



<400> 41 

Met Phe He Phe Leu Leu Phe Leu Thr Leu Thr Ser Gly Ser Asp Leu 
15 1 5 10 15 

Asp Arg Cys Thr Thr Phe Asp Asp Val Gin Ala Pro Asn Tyr Thr Gin 

20 25 30 

His Thr Ser Ser Met Arg Gly Val Tyr Tyr Pro Asp Glu He Phe Arg 
35 40 45 

20Ser Asp Thr Leu Tyr Leu Thr Gin Asp Leu Phe Leu Pro Phe Tyr Ser 
50 55 60 

Asn Val Thr Gly Phe His Thr He Asn His Thr Phe Gly Asn Pro Val 
65 70 75 80 

He Pro Phe Lys Asp Gly He Tyr Phe Ala Ala Thr Glu Lys Ser Asn 
25 85 90 95 

Val Val Arg Gly Trp Val Phe Gly Ser Thr Met Asn Asn Lys Ser Gin 

100 105 110 

Ser Val He He He Asn Asn Ser Thr Asn Val Val He Arg Ala Cys 
115 120 125 

3 0 Asn Phe Glu Leu Cys Asp Asn Pro Phe Phe Ala Val Ser Lys Pro Met 
130 135 140 

Gly Thr Gin Thr His Thr Met He Phe Asp Asn Ala Phe Asn Cys Thr 
145 150 155 160 

Phe Glu Tyr He Ser Asp Ala Phe Ser Leu Asp Val Ser Glu Lys Ser 
35 165 170 175 

Gly Asn Phe Lys His Leu Arg Glu Phe Val Phe Lys Asn Lys Asp Gly 

180 185 190 

Phe Leu Tyr Val Tyr Lys Gly Tyr Gin Pro He Asp Val Val Arg Asp 
195 200 205 

40 Leu Pro Ser Gly Phe Asn Thr Leu Lys Pro He Phe Lys Leu Pro Leu 
210 215 220 
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Gly He Asn He 

225 

Ala Gin Asp He 

5 Leu Lys Pro Thr 
260 

Thr Asp Ala Val 
275 

Ser Val Lys Ser 
10 290 

Phe Arg Val Val 

305 

' Asn Leu Cys Pro 

15Val Tyr Ala Trp 
340 

Ser Val Leu Tyr 

355 

Val Ser Ala Th.r 
20 370 

Asp Ser Phe Val 
385 

Gin Thr Gly Val 

25Met Gly Cys Val 
420 

Thr Gly Asn Tyr 
435 

Arg Pro Phe Glu 

30 450 

Lys Pro Cys Thr 
4 65 

Tyr Gly Phe Tyr 

3 5 Val Val Leu Ser 

500 

Pro Lys Leu Ser 
515 

Phe Asn Gly Leu 
40 530 

Phe Gin Pro Phe 
545 



Thr Asn Phe Arg 
230 

Trp Gly Thr Ser 
245 

Thr Phe Met Leu 

Asp Cys Ser Gin 
280 

Phe Glu He Asp 
295 

Pro Ser Gly Asp 

310 

Phe Gly Glu Val 
325 

Glu Arg Lys Lys 

Asn Ser Thr Phe 
360 

Lys Leu Asn Asp 
375 

Val Lys Gly Asp 
390 

He Ala Asp Tyr 

405 

Leu Ala Trp Asn 

Asn Tyr Lys Tyr 
440 

Arg Asp He Ser 

455 

Pro Pro Ala Leu 
470 

Thr Thr Thr Gly 
485 

Phe Glu Leu Leu 

Thr Asp Leu He 
520 

Thr Gly Thr Gly 
535 

Gin Gin Phe Gly 
550 



36 

Ala He Leu Thr 
235 

Ala Ala Ala Tyr 
250 

Lys * Tyr Asp Glu 
265 

Asn Pro Leu Ala 

Lys Gly He Tyr 
300 

Val Val Arg Phe 

315 

Phe Asn Ala Thr 
330 

He Ser Asn Cys 
345 

Phe Ser Thr Phe 

Leu Cys Phe Ser 
380 

Asp Val Arg Gin 
395 

Asn Tyr Lys Leu 

410 

Thr Arg Asn He 
425 

Arg Tyr Leu Arg 

Asn Val Pro Phe 

460 

Asn Cys Tyr Trp 
475 

He Gly Tyr Gin 
490 

Asn Ala Pro Ala 

505 

Lys Asn Gin Cys 

Val Leu Thr Pro 
540 

Arg Asp Val Ser 
555 



Ala Phe Ser Pro 
240 

Phe Val Gly Tyr 
255 

Asn Gly Thr He 

270 

Glu Leu Lys Cys 
285 

Gin Thr Ser Asn 

Pro Asn He Thr 
320 

Lys Phe Pro Ser 
335 

Val Ala Asp Tyr 
350 

Lys Cys Tyr Gly 

365 

Asn Val Tyr Ala 

He Ala Pro Gly 
400 

Pro Asp Asp Phe 

415 

Asp Ala Thr Ser 
430 

His Gly Lys Leu 
445 

Ser Pro Asp Gly 

Pro Leu Asn Asp 
480 

Pro Tyr Arg Val 
495 

Thr Val Cys Gly 
510 

Val Asn Phe Asn 
525 

Ser Ser Lys Arg 

Asp Phe Thr Asp 
560 
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Ser Val Arg Asp Pro 

565 

Ala Phe Gly Gly Val 
580 

5Glu Val Ala Val Leu 
595 

Ala lie His Ala Asp 

610 

Gly Asn Asn Val Phe 
10625 

His Val Asp Thr Ser 
645 

Cys Ala Ser Tyr His 
660 

15Ser lie Val Ala Tyr 
675 

Tyr Ser Asn Asn Thr 

690 

Thr Thr Glu Val Met 
20705 

Asn Met Tyr lie Cys 
725 

Gin Tyr Gly Ser Phe 

740 

25Ala Ala Glu Gin Asp 
755 

Gin Met Tyr Lys Thr 
770 

Ser Gin He Leu Pro 
30785 

Glu Asp Leu Leu Phe 
805 

Lys Gin Tyr Gly Glu 
820 

3 5 Cys Ala Gin Lys Phe 

835 

Asp Asp Met He Ala 
850 

Thr Ala Gly Trp Thr 
40865 

Ala Met Gin Met Ala 
885 



37 

Lys Thr Ser Glu He 

570 

Ser Val He Thr Pro 
585 

Tyr Gin Asp Val Asn 
600 

Gin Leu Thr Pro Ala 

615 

Gin Thr Gin Ala Gly 
630 

Tyr Glu Cys Asp He 
650 

Thr Val Ser Leu Leu 
665 

Thr Met Ser Leu Gly 
680 

He Ala He Pro Thr 
695 

Pro Val Ser Met Ala 

710 

Gly Asp Ser Thr Glu 
730 

Cys Thr Gin Leu Asn 
745 

Arg Asn Thr Arg Glu 
760 

Pro Thr Leu Lys Tyr 
775 

Asp Pro Leu Lys Pro 

790 

Asn Lys Val Thr Leu 
810 

Cys Leu Gly Asp He 
825 

Asn Gly Leu Thr Val 

840 

Ala Tyr Thr Ala Ala 
855 

Phe Gly Ala Gly Ala 
870 

Tyr Arg Phe Asn Gly 
890 



Leu Asp He Ser Pro Cys 

575 

Gly Thr Asn Ala Ser Ser 
590 

Cys Thr Asp Val Ser Thr 
605 

Trp Arg He Tyr Ser Thr 

62 0 

Cys Leu He Gly Ala Glu 
635 640 
Pro He Gly Ala Gly He 
655 

Arg Ser Thr Ser Gin Lys 
670 

Ala Asp Ser Ser He Ala 
685 

Asn Phe Ser He Ser He 
700 

Lys Thr Ser Val Asp Cys 
715 720 
Cys Ala Asn Leu Leu Leu 
735 

Arg Ala Leu Ser Gly He 
750 

Val Phe Ala Gin Val Lys 

765 

Phe Gly Gly Phe Asn Phe 
780 

Thr Lys Arg Ser Phe He 
795 800 
Ala Asp Ala Gly Phe Met 

815 

Asn Ala Arg Asp Leu He 
830 

Leu Pro Pro Leu Leu Thr 
845 

Leu Val Ser Gly Thr Ala 
860 

Ala Leu Gin He Pro Phe 
875 880 
He Gly Val Thr Gin Asn 
895 



wo 2005/010034 



PCT/US2004/023345 



38 

Val lieu Tyr Glu Asn Gin Lys Gin lie Ala Asn Gin Phe Asn Lys Ala 

900 905 910 

lie Ser Gin He Gin Glu Ser Leu Tlir Thr Thr Ser Thr Ala Leu Gly 
915 920 925 

5 Lys Leu Gin Asp Val Val Asn Gin Asn Ala Gin Ala Leu Asn Thr Leu 
930 935 940 

Val Lys Gin Leu Ser Ser Asn Phe Gly Ala He Ser Ser Val Leu Asn 
945 950 955 960 

Asp He Leu Ser Arg Leu Asp Lys Val Glu Ala Glu Val Gin He Asp 
10 965 970 975 

Arg Leu He Thr Gly Arg Leu Gin Ser Leu Gin Thr Tyr Val Thr Gin 

980 985 990 

Gin Leu He Arg Ala Ala Glu He 
995 1000 

15 

<210> 42 
<211> 1190 

<212> PRT 

<213> SARS coronavirus 

20 

<400> 42 

Met Phe He Phe Leu Leu Phe Leu Thr Leu Thr Ser Gly Ser Asp Leu 

15 10 15 

Asp Arg Cys Thr Thr Phe Asp Asp Val Gin Ala Pro Asn Tyr Thr Gin 
25 20 25 30 

His Thr Ser Ser Met Arg Gly Val Tyr Tyr Pro Asp Glu He , Phe Arg 

35 40 45 

Ser Asp Thr Leu Tyr Leu Thr Gin Asp Leu Phe Leu Pro Phe Tyr Ser 
50 55 60 

3 0Asn Val Thr Gly Phe His Thr He Asn His Thr Phe Gly Asn Pro Val 

65 70 75 80 

He Pro Phe Lys Asp Gly He Tyr Phe Ala Ala Thr Glu Lys Ser Asn 

85 90 95 

Val Val Arg Gly Trp Val Phe Gly Ser Thr Met Asn Asn Lys Ser Gin 
35 100 105 110 

Ser Val He He He Asn Asn Ser Thr Asn Val Val He Arg Ala Cys 

115 120 125 

Asn Phe Glu Leu Cys Asp Asn Pro Phe Phe Ala Val Ser Lys Pro Met 
130 135 140 

4 0Gly Thr Gin Thr His Thr Met He Phe Asp Asn Ala Phe Asn Cys Thr 

145 150 155 160 
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Phe Glu Tyr lie 

Gly Asn Phe Lys 
180 

5 Phe Leu Tyr Val 
195 

Leu Pro Ser Gly 

210 

Gly lie Asn lie 
10225 

Ala Gin Asp He 

Leu Lys Pro Thr 
260 

15 Thr Asp Ala Val 
275 

Ser Val Lys Ser 

290 

Phe Arg Val Val 
20305 

Asn Leu Cys Pro 

Val Tyr Ala Trp 

340 

2 5 Ser Val Leu Tyr 
355 

Val Ser Ala Thr 
370 

Asp Ser Phe Val 
30385 

Gin Thr Gly Val 

Met Gly Cys Val 
420 

3 5 Thr Gly Asn Tyr 

435 

Arg Pro Phe Glu 
450 

Lys Pro Cys Thr 
40465 

Tyr Gly Phe Tyr 



Ser Asp Ala Phe 

165 

His Leu Arg Glu 

Tyr Lys Gly Tyr 
200 

Phe Asn Thr Leu 
215 

Thr Asn Phe Arg 
230 

Trp Gly Thr Ser 

245 

Thr Phe Met Leu 

Asp Cys Ser Gin 
280 

Phe Glu He Asp 

295 

Pro Ser Gly Asp 
310 

Phe Gly Glu Val 
325 

Glu Arg Lys Lys 

Asn Ser Thr Phe 
360 

Lys Leu Asn Asp 
375 

Val Lys Gly Asp 
390 

He Ala Asp Tyr 
405 

Leu Ala Trp Asn 

Asn Tyr Lys Tyr 

440 

Arg Asp He Ser 
455 

Pro Pro Ala Leu 
470 

Thr Thr Thr Gly 
485 



39 

Ser Leu Asp Val 
170 

Phe Val Phe Lys 
185 

Gin Pro He Asp 

Lys Pro He Phe 
220 

Ala He Leu Thr 
235 

Ala Ala Ala Tyr 

250 

Lys Tyr Asp Glu 
265 

Asn Pro Leu Ala 

Lys Gly He Tyr 
300 

Val Val Arg Phe 
315 

Phe Asn Ala Thr 
330 

He Ser Asn Cys 
345 

Phe Ser Thr Phe 

Leu Cys Phe Ser 
380 

Asp Val Arg Gin 

395 

Asn Tyr Lys Leu 
410 

Thr Arg Asn He 
425 

Arg Tyr Leu Arg 

Asn Val Pro Phe 
460 

Asn Cys Tyr Trp 
475 

He Gly Tyr Gin 
490 



Ser Glu Lys Ser 
175 

Asn Lys Asp Gly 
190 

Val Val Arg Asp 
205 

Lys Leu Pro Leu 

Ala Phe Ser Pro 
240 

Phe Val Gly Tyr 

255 

Asn Gly Thr He 
270 

Glu Leu Lys Cys 
285 

Gin Thr Ser Asn 

Pro Asn He Thr 
320 

Lys Phe Pro Ser 
335 

Val Ala Asp Tyr 
350 

Lys Cys Tyr Gly 
365 

Asn Val Tyr Ala 

He Ala Pro Gly 

400 

Pro Asp Asp Phe 
415 

Asp Ala Thr Ser 
430 

His Gly Lys Leu 

445 

Ser Pro Asp Gly 

Pro Leu Asn Asp 
480 

Pro Tyr Arg Val 
495 
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Val Val Leu Ser Phe Glu Leu Leu Asn Ala Pro Ala Thr Val Cys Gly 

500 505 510 

Pro Lys Leu Ser Xhr Asp Leu lie Lys Asn Gin Cys Val Asn Phe Asn 
515 520 525 

5 Phe Asn Gly Leu Thr Gly Thr Gly Val Leu Thr Pro Ser Ser Lys Arg 
530 535 540 

Phe Gin Pro Phe Gin Gin Phe Gly Arg Asp Val Ser Asp Phe Thr Asp 
545 550 555 560 

Ser Val Arg Asp Pro Lys Thr Ser Glu lie Leu Asp He Ser Pro Cys 
10 565 570 575 

Ala Phe Gly Gly Val Ser Val He Thr Pro Gly Thr Asn Ala Ser Ser 

580 585 590 

Glu Val Ala Val Leu Tyr Gin Asp Val Asn Cys Thr Asp Val Ser Thr 
595 600 605 

ISAla He His Ala Asp Gin Leu Thr Pro Ala Trp Arg He Tyr Ser Thr 
610 615 620 

Gly Asn Asn Val Phe Gin Thr Gin Ala Gly Cys Leu He Gly Ala Glu 
625 630 635 640 

His Val Asp Thr Ser Tyr Glu Cys Asp He Pro He Gly Ala Gly He 
20 645 650 655 

Cys Ala Ser Tyr His Thr Val Ser Leu Leu Arg Ser Thr Ser Gin Lys 

660 665 670 

Ser He Val Ala Tyr Thr Met Ser Leu Gly Ala Asp Ser Ser He Ala 
675 680 685 

25Tyr Ser Asn Asn Thr He Ala He Pro Thr Asn Phe Ser He Ser He 
690 695 700 

Thr Thr Glu Val Met Pro Val Ser Met Ala Lys Thr Ser Val Asp Cys 
705 710 715 720 

Asn Met Tyr He Cys Gly Asp Ser Thr Glu Cys Ala Asn Leu Leu Leu 
30 725 730 735 

Gin Tyr Gly Ser Phe Cys Thr Gin Leu Asn Arg Ala Leu Ser Gly He 

740 745 750 

Ala Ala Glu Gin Asp Arg Asn Thr Arg Glu Val Phe Ala Gin Val Lys 
755 760 765 

35Gln Met Tyr Lys Thr Pro Thr Leu Lys Tyr Phe Gly Gly Phe Asn Phe 
770 775 780 

Ser Gin He Leu Pro Asp Pro Leu Lys Pro Thr Lys Arg Ser Phe He 
785 790 795 800 

Glu Asp Leu Leu Phe Asn Lys Val Thr Leu Ala Asp Ala Gly Phe Met 
40 805 810 815 

Lys Gin Tyr Gly Glu Cys Leu Gly Asp He Asn Ala Arg Asp Leu He 
820 825 830 
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Cys Ala Gin Lys Phe Asn Gly Leu Thr Val Leu Pro Pro Leu Leu Thr 

835 840 845 

Asp Asp Met Xle Ala Ala Tyr Thr Ala Ala Leu Val Ser Gly Thr Ala 
850 855 860 

5Thr Ala Gly Trp Thr Phe Gly Ala Gly Ala Ala Leu Gin lie Pro Phe 
865 870 875 880 

Ala Met Gin Met Ala Tyr Arg Phe Asn Gly He Gly Val Thr Gin Asn 

885 890 895 

Val Leu Tyr Glu Asn Gin Lys Gin He Ala Asn Gin Phe Asn Lys Ala 
10 900 905 910 

He Ser Gin He Gin Glu Ser Leu Thr Thr Thr Ser Thr Ala Leu Gly 

915 920 925 

Lys Leu Gin Asp Val Val Asn Gin Asn Ala Gin Ala Leu Asn Thr Leu 
930 935 940 

15Val Lys Gin Leu Ser Ser Asn Phe Gly Ala He Ser Ser Val Leu Asn 
945 ^ 950 955 960 

Asp He Leu Ser Arg Leu Asp Lys Val Glu Ala Glu Val Gin He Asp 

965 970 975 

Arg Leu He Thr Gly Arg Leu Gin Ser Leu Gin Thr Tyr Val Thr Gin 
20 980 985 990 

Gin Leu He Arg Ala Ala Glu He Arg Ala Ser Ala Asn Leu Ala Ala 

995 1000 1005 

Thr Lys Met Ser Glu Cys Val Leu Gly Gin Ser Lys Arg Val Asp Phe 
1010 1015 1020 

25Cys Gly Lys Gly Tyr His Leu Met Ser Phe Pro Gin Ala Ala Pro His 
1025 1030 1035 1040 

Gly Val Val Phe Leu His Val Thr Tyr Val Pro Ser Gin Glu Arg Asn 

1045 1050 1055 

Phe Thr Thr Ala Pro Ala He Cys His Glu Gly Lys Ala Tyr Phe Pro 
30 1060 1065 1070 

Arg Glu Gly Val Phe Val Phe Asn Gly Thr Ser Trp Phe He Thr Gin 
1075 1080 1085 

^ Arg Asn Phe Phe Ser Pro Gin He He Thr Thr Asp Asn Thr Phe Val 
1090 1095 1100 

3 5 Ser Gly Asn Cys Asp Val Val He Gly He He Asn Asn Thr Val Tyr 
1105 1110 1115 1120 

Asp Pro Leu Gin Pro Glu Leu Asp Ser Phe Lys Glu Glu Leu Asp Lys 

1125 1130 1135 

Tyr Phe Lys Asn His Thr Ser Pro Asp Val Asp Leu Gly Asp He Ser 
40 1140 1145 1150 

Gly He Asn Ala Ser Val Val Asn He Gin Lys Glu He Asp Arg Leu 
1X55 1160 1165 
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Asn Glu Val Ala Lys Asn Leu Asn Glu Ser Leu lie Asp Leu Gin Glu 

1170 1175 1180 

Leu Gly Lys Tyr Glu Gin 
1185 1190 

5 

<210> 43 

<211> 84 
<212> PRT 

<213> SARS coronavirus 

10 

<400> 43 

Asp Arg Cys Thr Thr Phe Asp Asp Val Gin Ala Pro Asn Tyr Thr Gin 

15 10 15 

His Thr Ser Ser Met Arg Gly Val Tyr Tyr Pro Asp Glu He Ptie Arg 
15 20 25 30 

Ser Asp Thr Leu Tyr Leu Thr Gin Asp Leu Phe Leu Pro Phe Tyr Ser 

35 40 45 

Asn Val Thr Gly Phe His Thr He Asn His Thr Phe Gly Asn Pro Val 
50 55 60 

2 0Ile Pro Phe Lys Asp Gly He Tyr Phe Ala Ala Thr Glu Lys Ser Asn 
65 70 75 80 

Val Val Arg Gly 



25<210> 44 
<211> 184 
<212> PRT 

<213> SARS coronavirus 
30<400> 44 

Asp Arg Cys Thr Thr Phe Asp Asp Val Gin Ala Pro Asn Tyr Thr Gin 

15 10 15 

His Thr Ser Ser Met Arg Gly Val Tyr Tyr Pro Asp Glu He Phe Arg 
20 25 30 

35Ser Asp Thr Leu Tyr Leu Thr Gin Asp Leu Phe Leu Pro Phe Tyr Ser 
35 40 45 

Asn Val Thr Gly Phe His Thr He Asn His Thr Phe Gly Asn Pro Val 

50 55 60 

He Pro Phe Lys Asp Gly He Tyr Phe Ala Ala Thr Glu Lys Ser Asn 
4065 70 75 80 

Val Val Arg Gly Trp Val Phe Gly Ser Thr Met Asn Asn Lys Ser Gin 
85 90 95 
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Ser Val He He He Asn Asn Ser Thr Asn Val Val He Arg Ala Cys 

100 105 110 

Asn Phe Glu Leu Cys Asp Asn Pro Phe Phe Ala Val Ser Lys Pro Met 
115 120 125 

5Gly Thr Gin Thr His Thr Met He Phe Asp Asn Ala Phe Asn Cys Thr 
130 135 140 

Phe Glu Tyr He Ser Asp Ala Phe Ser Leu Asp Val Ser Glu Lys Ser 
145 150 155 160 

Gly Asn Phe Lys His Leu Arg Glu Phe Val Phe Lys Asn Lys Asp Gly 
10 165 170 175 

Phe Leu Tyr Val Tyr Lys Gly Tyr 
180 



<210> 45 
15<211> 384 
<212> PRT 

<213> SARS coronavirus 



<400> 45 
2 0 Asp Arg Cys Thr 
1 

His Thr Ser Ser 
20 

Ser Asp Thr Leu 
25 35 

Asn Val Thr Gly 
50 

He Pro Phe Lys 
65 

3 oval Val Arg Gly 

Ser Val He He 
100 

Asn Phe Glu Leu 
35 115 

Gly Thr Gin Thr 
130 

Phe Glu Tyr He 
145 

4 0Gly Asn Phe Lys 



Thr Phe Asp Asp 
5 

Met Arg Gly Val 

Tyr Leu Thr Gin 

40 

Phe His Thr He 
55 

Asp Gly He Tyr 
70 

Trp Val Phe Gly 

85 

He Asn Asn Ser 

Cys Asp Asn Pro 
120 

His Thr Met He 
135 

Ser Asp Ala Phe 
150 

His Leu Arg Glu 
165 



Val Gin Ala Pro 
10 

Tyr Tyr Pro Asp 
25 

Asp Leu Phe Leu 

Asn His Thr Phe 
60 

Phe Ala Ala Thr 
75 

Ser Thr Met Asn 
90 

Thr Asn Val Val 
105 

Phe Phe Ala Val 

Phe Asp Asn Ala 
140 

Ser Leu Asp Val 
155 

Phe Val Phe Lys 
170 



Asn Tyr Thr Gin 
15 

Glu lie Phe Arg 
30 

Pro Phe Tyr Ser 

45 

Gly Asn Pro Val 

Glu Lys Ser Asn 
80 

Asn Lys Ser Gin 

95 

He Arg Ala Cys 
110 

Ser Lys Pro Met 
125 

Phe Asn Cys Thr 

Ser Glu Lys Ser 
160 

Asn Lys Asp Gly 
175 
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Phe Leu Tyr Val Tyr Lys Gly Tyr Gin Pro He Asp Val Val Arg Asp 

180 185 190 

Leu Pro Ser Gly Phe Asix Thr Leu Lys Pro Xle Phe Lys Leu Pro Leu 
195 200 205 

5Gly He Asn He Thr Asn Phe Arg Ala He Leu Thr Ala Phe Ser Pro 
210 215 220 

Ala Gin Asp He Trp Gly Thr Ser Ala Ala Ala Tyr Phe Val Gly Tyr 
225 230 235 240 

Leu Lys Pro Thr Thr Phe Met Leu Lys Tyr Asp Glu Asn Gly Thr He 
10 245 250 255 

Thr Asp Ala Val Asp Cys Ser Gin Asn Pro Leu Ala Glu Leu Lys Cys 

260 265 270 

Ser Val Lys Ser Phe Glu He Asp Lys Gly He Tyr Gin Thr Ser Asn 
275 280 285 

15Phe Arg Val Val Pro Ser Gly Asp Val Val Arg Phe Pro Asn He Thr 
290 295 300 

Asn Leu Cys Pro Phe Gly Glu Val Phe Asn Ala Thr Lys Phe Pro Ser 
305 310 315 320 

Val Tyr Ala Trp Glu Arg Lys Lys He Ser Asn Cys Val Ala Asp Tyr 
20 325 330 335 

Ser Val Leu Tyr Asn Ser Thr Phe Phe Ser Thr Phe Lys Cys Tyr Gly 

340 345 350 

Val Ser Ala Thr Lys Leu Asn Asp Leu Cys Phe Ser Asn Val Tyr 'Ala 
355 360 365 

25Asp Ser Phe Val Val Lys Gly Asp Asp Val Arg Gin He Ala Pro Gly 
370 375 380 



<210> 46 
<211> 584 
30<212> PRT 

<213> SARS coronavirus 



<400> 46 
Asp Arg Cys Thr 
35 1 

His Thr Ser Ser 
20 

Ser Asp Thr Leu 
35 

40Asn Val Thr Gly 
50 



Thr Phe Asp Asp 
5 

Met Arg Gly Val 

Tyr Leu Thr Gin 
40 

Phe His Thr He 
55 



Val Gin Ala Pro 
10 

Tyr Tyr Pro Asp 
25 

Asp Leu Phe Leu 

Asn His Thr Phe 
60 



Asn Tyr Thr Gin 
15 

Glu He Phe Arg 

30 

Pro Phe Tyr Ser 
45 

Gly Asn Pro Val 
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He Pro Phe Lys Asp Gly He Tyr Phe Ala Ala Thr Glu Lys Ser Asn 
S5 70 75 80 

Val Val Arg Gly Trp Val Phe Gly Ser Thr Met Asn Asn Lys Ser Gin 
85 90 95 

5Ser Val He He He Asn Asn Ser Thr Asn Val Val He Arg Ala Cys 
100 105 110 

Asn Phe Glu Leu Cys Asp Asn Pro Phe Phe Ala Val Ser Lys Pro Met 

115 120 125 

Gly Thr Gin Thr His Thr Met He Phe Asp Asn Ala Phe Asn Cys Thr 
10 130 135 140 

Phe Glu Tyr He Ser Asp Ala Phe Ser Leu Asp Val Ser Glu Lys Ser 

150 155 160 

Gly Asn Phe Lys His Leu Arg Glu Phe Val Phe Lys Asn Lys Asp Gly 
165 170 175 

15 Phe Leu Tyr Val Tyr Lys Gly Tyr Gin Pro He Asp ,Val Val Arg Asp 
180 185 190 

Leu Pro Ser Gly Phe Asn Thr Leu Lys Pro He Phe Lys Leu Pro Leu 

195 200 205 

Gly He Asn He Thr Asn Phe Arg Ala He Leu Thr Ala Phe Ser Pro 
20 210 215 220 

Ala Gin Asp He Trp Gly Thr Ser Ala Ala Ala Tyr Phe Val Gly Tyr 
225 230 235 240 

Leu Lys Pro Thr Thr Phe Met Leu Lys Tyr Asp Glu Asn Gly Thr He 
245 250 255 

25Thr Asp Ala Val Asp Cys Ser Gin Asn Pro Leu Ala Glu Leu Lys Cys 
260 265 270 

Ser Val Lys Ser Phe Glu He Asp Lys Gly He Tyr Gin Thr Ser Asn 

275 280 285 

Phe Arg Val Val Pro Ser Gly Asp Val Val Arg Phe Pro Asn He Thr 
30 290 295 300 

Asn Leu Cys Pro Phe Gly Glu Val Phe Asn Ala Thr Lys Phe Pro Ser 
305 310 315 320 

Val Tyr Ala Trp Glu Arg Lys Lys He Ser Asn Cys Val Ala Asp Tyr 
325 330 335 

35Ser Val Leu Tyr Asn Ser Thr Phe Phe Ser Thr Phe Lys Cys Tyr Gly 
340 345 350 

Val Ser Ala Thr Lys Leu Asn Asp Leu Cys Phe Ser Asn Val Tyr Ala 

355 360 365 

Asp Ser Phe Val Val Lys Gly Asp Asp Val Arg Gin He Ala Pro Gly 
40 370 375 380 

Gin Thr Gly Val He Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe 
^85 390 395 400 
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4S 

Met Gly Cys Val Leu Ala Trp Asn Thr Arg Asn lie Asp Ala Thr Ser 

405 410 415 

Thr Gly Asn Tyr Asn Tyr Lys Tyr Arg Tyr Leu Arg His Gly Lys Leu 
420 425 430 

SArg Pro Phe Qlu Arg Asp lie Ser Asn Val Pro Phe Ser Pro Asp Gly 
435 440 445 

Lys Pro Cys Thr Pro Pro Ala Leu Asn Cys Tyr Trp Pro Leu Asn Asp 

450 455 4gO 

Tyr Gly Phe Tyr Thr Thr Thr Gly lie Gly Tyr Gin Pro Tyr Arg Val 

470 475 480 

Val Val Leu Ser Phe Glu Leu Leu Asn Ala Pro Ala Thr Val Cys Gly 

485 490 495 

Pro Lys Leu Ser Thr Asp Leu lie Lys Asn Gin Cys Val Asn Phe Asn 
500 505 510 

ISPhe Asn Gly Leu Thr Gly Thr Gly Val Leu Thr Pro Ser Ser Lys Arg 
515 520 525 

Phe Gin Pro Phe Gin Gin Phe Gly Arg Asp Val Ser Asp Phe Thr Asp 
530 535 54Q 

Ser Val Arg Asp Pro Lys Thr Ser Glu lie Leu Asp lie Ser Pro Cys 
20545 550 555 

Ala Phe Gly Gly Val Ser Val He Thr Pro Gly Thr Asn Ala Ser Ser 

565 570 575 

Glu Val Ala Val Leu Tyr Gin Asp 
580 

25 

<210> 47 
<211> 784 
<212> PRT 

<213> SARS coronavirus 

30 

<400> 47 

Asp Arg Cys Thr Thr Phe Asp Asp Val Gin Ala Pro Asn Tyr Thr Gin 

1 5 10 15 

His Thr Ser Ser Met Arg Gly Val Tyr Tyr Pro Asp Glu He Phe Arg 
35 20 25 30 

Ser Asp Thr Leu Tyr Leu Thr Gin Asp Leu Phe Leu Pro Phe Tyr Ser 

35 40 45 

Asn Val Thr Gly Phe His Thr He Asn His Thr Phe Gly Asn Pro Val 
50 55 60 

4 one Pro Phe Lys Asp Gly He Tyr Phe Ala Ala Thr Glu Lys Ser Asn 

70 75 80 
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Val Val Arg Gly Trp Val Phe Gly Ser Thr Met Asn Asn Lys Ser Gin 

85 90 95 

Ser Val lie lie lie Asn Asn Ser Thr Asn Val Val lie Arg Ala Cys 
100 105 110 

5Asn Phe Glu Leu Cys Asp Asn Pro Phe Phe Ala Val Ser Lys Pro Met 
115 120 125 

Gly Thr Gin Thr His Thr Met He Phe Asp Asn Ala Phe Asn Cys Thr 

130 135 140 

Phe Glu Tyr He Ser Asp Ala Phe Ser Leu Asp Val Ser Glu Lys Ser 
10145 150 155 160 

Gly Asn Phe Lys His Leu Arg Glu Phe Val Phe Lys Asn Lys Asp Gly 

165 170 175 

Phe Leu Tyr Val Tyr Lys Gly Tyr Gin Pro He Asp Val Val Arg Asp 
180 185 190 

15Leu Pro Ser Gly Phe Asn Thr Leu Lys Pro He Phe Lys Leu Pro Leu 
195 200 205 

Gly He Asn He Thr Asn Phe Arg Ala He Leu Thr Ala Phe Ser Pro 

210 215 220 

Ala Gin Asp He Trp Gly Thr Ser Ala Ala Ala Tyr Phe Val Gly Tyr 
20225 230 235 240 

Leu Lys Pro Thr Thr Phe Met Leu Lys Tyr Asp Glu Asn Gly Thr He 

245 250 255 

Thr Asp Ala Val Asp Cys Ser Gin Asn Pro Leu Ala Glu Leu Lys Cys 
260 265 270 

25Ser Val Lys Ser Phe Glu He Asp Lys Gly He Tyr Gin Thr Ser Asn 
275 280 285 

Phe Arg Val Val Pro Ser Gly Asp Val Val Arg Phe Pro Asn He Thr 

290 295 300 

Asn Leu Cys Pro Phe Gly Glu Val Phe Asn Ala Thr Lys Phe Pro Ser 
30305 310 315 320 

Val Tyr Ala Trp Glu Arg Lys Lys He Ser Asn Cys Val Ala Asp Tyr 

325 330 335 

Ser Val Leu Tyr Asn Ser Thr Phe Phe Ser Thr Phe Lys Cys Tyr Gly 
340 345 350 

3 5 Val Ser Ala Thr Lys Leu Asn Asp Leu Cys Phe Ser Asn Val Tyr Ala 
355 360 365 

Asp Ser Phe Val Val Lys Gly Asp Asp Val Arg Gin He Ala Pro Gly 

370 375 380 

Gin Thr Gly Val He Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe 
40385 390 395 400 

Met Gly Cys Val Leu Ala Trp Asn Thr Arg Asn He Asp Ala Thr Ser 
405 410 415 
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Thr Gly Asn Tyr Asn Tyr I^ys Tyr Arg Tyr Leu Arg His Qly Lys Leu 

420 425 430 

Arg Pro Phe Glu Arg Asp lie Ser Asn Val Pro Phe Ser Pro Asp Gly 
435 440 445 

SLys Pro Cys Thr Pro Pro Ala Leu Asn Cys Tyr Trp Pro Leu Asn Asp 
450 455 4gO 

Tyr Gly Phe Tyr Thr Thr Thr Gly lie Gly Tyr Gin Pro Tyr Arg Val 

470 475 480 

Val Val Leu Ser Phe Glu Leu Leu Asn Ala Pro Ala Thr Val Cys Gly 
^° 485 490 495 

Pro Lys Leu Ser Thr Asp Leu lie Lys Asn Gin Cys Val Asn Phe Asn 

500 505 510 

Phe Asn Gly Leu Thr Gly Thr Gly Val Leu Thr Pro Ser Ser Lys Arg 
515 520 525 

ISPhe Gin Pro Phe Gin Gin Phe Gly Arg Asp Val Ser Asp Phe Thr Asp 

535 540 
Ser Val Arg Asp Pro Lys Thr Ser Glu He Leu Asp lie Ser Pro Cys 

550 555 560 

Ala Phe Gly Gly Val Ser Val He Thr Pro Gly Thr Asn Ala Ser Ser 
2° 565 570 

Glu Val Ala Val Leu Tyr Gin Asp Val Asn Cys Thr Asp Val Ser Thr 

580 585 590 

Ala He His Ala Asp Gin Leu Thr Pro Ala Trp Arg He Tyr Ser Thr 
595 600 605 

25Gly Asn Asn Val Phe Gin Thr Gin Ala Gly Cys Leu He Gly Ala Glu 

615 620 
His Val Asp Thr Ser Tyr Glu Cys Asp He Pro He Gly Ala Gly He 

630 635 640 

Cys Ala ser Tyr His Thr Val' Ser Leu Leu Arg Ser Thr Ser Gin Lys 
^° 645 650 655 

Ser He Val Ala Tyr Thr Met Ser Leu Gly Ala Asp Ser Ser He Ala 

660 665 670 

Tyr Ser Asn Asn Thr He Ala He Pro Thr Asn Phe Ser He Ser He 
6*75 S80 685 

35Thr Thr Glu Val Met Pro Val Ser Met Ala Lys Thr Ser Val Asp Cys 
S90 695 700 

Asn Met Tyr He Cys Gly Asp Ser Thr Glu Cys Ala Asn Leu Leu Leu 
■^^^ 710 715 720 

Gin Tyr Gly Ser Phe Cys Thr Gin Leu Asn Arg Ala Leu Ser Gly He 

40 725 nin 

'■^3 730 735 

Ala Ala Glu Gin Asp Arg Asn Thr Arg Glu Val Phe Ala Gin Val Lys 

■740 745 750 
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Gin Met Tyr Lys Thr Pro Thr Leu Lys Tyr Phe Gly Gly Phe Asn Phe 

755 760 765 

Ser Gin lie Leu Pro Asp Pro Leu Lys Pro Thr Lys Arg Ser Phe lie 
770 775 780 

5 

<210> 48 
<211> 984 
<212> PRT 

<213> SARS coronavirus 

10 

<400> 48 

Asp Arg Cys Thr Thr Phe Asp Asp Val Gin Ala Pro Asn Tyr Thr Gin 

15 10 15 

His Thr Ser Ser Met Arg Gly Val Tyr Tyr Pro Asp Glu lie Phe Arg 
15 20 25 30 

Ser Asp Thr Leu Tyr Leu Thr Gin Asp Leu Phe Leu Pro Phe Tyr Ser 

35 40 45 

Asn Val Thr Gly Phe His Thr lie Asn His Thr Phe Gly Asn Pro Val 
50 55 60 

20Ile Pro Phe Lys Asp Gly lie Tyr Phe Ala Ala Thr Glu Lys Ser Asn 
65 70 75 80 

Val Val Arg Gly Trp Val Phe Gly Ser Thr Met Asn Asn Lys Ser Gin 

85 90 95 

Ser Val He He He Asn Asn Ser Thr Asn Val Val He Arg Ala Cys 
25 100 105 110 

Asn Phe Glu Leu Cys Asp Asn Pro Phe Phe Ala Val Ser Lys Pro Met 

115 120 125 

Gly Thr Gin Thr His Thr Met He Phe Asp Asn Ala Phe Asn Cys Thr 
130 135 140 

3 0Phe Glu Tyr He Ser Asp Ala Phe Ser Leu Asp Val Ser Glu Lys Ser 
145 150 155 160 

Gly Asn Phe Lys His Leu Arg Glu Phe Val Phe Lys Asn Lys Asp Gly 

165 170 175 

Phe Leu Tyr Val Tyr Lys Gly Tyr Gin Pro He Asp Val Val Arg Asp 
35 180 185 190 

Leu Pro Ser Gly Phe Asn Thr Leu Lys Pro He Phe Lys Leu Pro Leu 

195 200 205 

Gly He Asn He Thr Asn Phe Arg Ala He Leu Thr Ala Phe Ser Pro 
210 215 220 

40 Ala Gin Asp He Trp Gly Thr Ser Ala Ala Ala Tyr Phe Val Gly Tyr 
225 230 235 240 



wo 2005/010034 



PCT/US2004/023345 



Leu Lys Pro Thr 

Thr Asp Ala Val 
260 

5Ser Val Lys Ser 
275 

Phe Arg Val Val 
290 

Asn Leu Cys Pro 
10305 

Val Tyr Ala Trp 

Ser Val Leu Tyr 

340 

15Val Ser Ala Thr 
355 

Asp Ser Phe Val 

370 

Gin Thr Gly Val 
20385 

Met Gly Cys Val 

Thr Gly Asn Tyr 
420 

2 5 Arg Pro Phe Glu 
435 

Lys Pro Cys Thr 
450 

Tyr Gly Phe Tyr 
30465 

Val Val Leu Ser 

Pro Lys Leu Ser 
500 

3 5Phe Asn Gly Leu 

515 

Phe Gin Pro Phe 
530 

Ser Val Arg Asp 
40545 

Ala Phe Gly Gly 



Thr Phe Met Leu 

245 

Asp Cys Ser Gin 

Phe Glu lie Asp 
280 

Pro Ser Gly Asp 
295 

Phe Gly Glu Val 
310 

Glu Arg Lys Lys 
325 

Asn Ser Thr Phe 

Lys Leu Asn Asp 
360 

Val Lys Gly Asp 
375 

lie Ala Asp Tyr 

390 

Leu Ala Trp Asn 
405 

Asn Tyr Lys Tyr 

Arg Asp He Ser 

440 

Pro Pro Ala Leu 
455 

Thr Thr Thr Gly 

470 

Phe Glu Leu Leu 
485 

Thr Asp Leu He 

Thr Gly Thr Gly 

520 

Gin Gin Phe Gly 
535 

Pro Lys Thr Ser 
550 

Val Ser Val He 
565 



50 

Lys Tyr Asp Glu 

250 

Asn Pro Leu Ala 
265 

Lys Gly He Tyr 

Val Val Arg Phe 
300 

Phe Asn Ala Thr 
315 

He Ser Asn Cys 
330 

Phe Ser Thr Phe 

345 

Leu Cys Phe Ser 

Asp Val Arg Gin 
380 

Asn Tyr Lys Leu 

395 

Thr Arg Asn He 
410 

Arg Tyr Leu Arg 
425 

Asn Val Pro Phe 

Asn Cys Tyr Trp 
460 

He Gly Tyr Gin 
475 

Asn Ala Pro Ala 

490 

Lys Asn Gin Cys 
505 

Val Leu Thr Pro 

Arg Asp Val Ser 
540 

Glu He Leu Asp 
555 

Thr Pro Gly Thr 



Asn Gly Thr He 

255 

Glu Leu Lys Cys 
270 

Gin Thr Ser Asn 
285 

Pro Asn He Thr 

Lys Phe Pro Ser 
320 

Val Ala Asp Tyr 
335 

Lys Cys Tyr Gly 

350 

Asn Val Tyr Ala 
365 

He Ala Pro Gly 

Pro Asp Asp Phe 

400 

Asp Ala Thr Ser 
415 

His Gly Lys Leu 
430 

Ser Pro Asp Gly 

445 

Pro Leu Asn Asp 

Pro Tyr Arg Val 
480 

Thr Val Cys Gly 
495 

Val Asn Phe Asn 
510 

Ser Ser Lys Arg 
525 

Asp Phe Thr Asp 

He Ser Pro Cys 
560 

Asn Ala Ser Ser 
575 
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Glu Val Ala Val 
580 

Ala lie His Ala 
595 

5Gly Asn Asn Val 
610 

His Val Asp Thr 
625 

Cys Ala Ser Tyr 

10 

Ser lie Val Ala 
660 

Tyr Ser Asn Asn 
675 

15Thr Thr Glu Val 
690 

Asn Met Tyr Xle 
705 

Gin Tyr Gly Ser 

20 

Ala Ala Glu Gin 
740 

Gin Met Tyr Lys 

755 

25Ser Gin lie Leu 
770 

Glu Asp Leu Leu 
785 

Lys Gin Tyr Gly 

30 

Cys Ala Gin Lys 
820 

Asp Asp Met lie 
835 

3 5 Thr Ala Gly Tirp 
850 

Ala Met Gin Met 
865 

Val Leu Tyr Glu 

40 

He Ser Gin He 
900 



Leu Tyr Gin Asp 

Asp Gin Leu Thr 
600 

Phe Gin Thr Gin 
615 

Ser Tyr Glu Cys 

630 

His Thr Val Ser 
645 

Tyr Thr Met Ser 

Thr He Ala He 
680 

Met Pro Val Ser 
695 

Cys Gly Asp Ser 
710 

Phe Cys Thr Gin 

725 

Asp Arg Asn Thr 

Thr Pro Thr Leu 
760 

Pro Asp Pro Leu 

775 

Phe Asn Lys Val 
790 

Glu Cys Leu Gly 

805 

Phe Asn Gly Leu 

Ala Ala Tyr Thr 
84 0 

Thr Phe Gly Ala 
855 

Ala Tyr Arg Phe 
870 

Asn Gin Lys Gin 
885 

Gin Glu Ser Leu 



51 

Val Asn Cys Thr 

585 

Pro Ala Trp Arg 

Ala Gly Cys Leu 
62 0 

Asp He Pro He 

635 

Leu Leu Arg Ser 
650 

Leu Gly Ala Asp 
665 

Pro Thr Asn Phe 

Met Ala Lys Thr 
700 

Thr Glu Cys Ala 
715 

Leu Asn Arg Ala 

730 

Arg Glu Val Phe 
745 

Lys Tyr Phe Gly 

Lys Pro Thr Lys 

780 

Thr Leu Ala Asp 
795 

Asp He Asn Ala 
810 

Thr Val Leu Pro 
825 

Ala Ala Leu Val 

Gly Ala Ala Leu 
860 

Asn Gly He Gly 
875 

He Ala Asn Gin 
890 

Thr Thr Thr Ser 
905 



Asp Val Ser Thr 

590 

He Tyr Ser Thr 
605 

He Gly Ala Glu 

Gly Ala Gly He 

640 

Thr Ser Gin Lys 
655 

Ser Ser He Ala 
670 

Ser He Ser He 

685 

Ser Val Asp Cys 

Asn Leu Leu Leu 
720 

Leu Ser Gly He 

735 

Ala Gin Val Lys 
750 

Gly Phe Asn Phe 
765 

Arg Ser Phe He 

Ala Gly Phe Met 
800 

Arg Asp Leu He 
815 

Pro Leu Leu Thr 
830 

Ser Gly Thr Ala 
845 

Gin He Pro Phe 

Val Thr Gin Asn 

880 

Phe Asn Lys Ala 
895 

Thr Ala Leu Gly 
910 
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Lys Le-u Gin Asp Val Val Asn Gin 
915 920 
Val Lys Gin Leu Ser Ser Asn Phe 
93q 935 
SAsp lie Leu Ser Arg Leu Asp Lys 
945 950, 
Arg Leu lie Tlir Gly Arg Leu Gin 
965 

Gin Leu lie Arg Ala Ala Glu He 
10 980 



52 

Asn Ala Gin Ala Leu Asn Thr Leu 
925 

Gly Ala lie Ser Ser Val Leu Asn 
940 

Val Glu Ala Glu Val Gin He Asp 
955 960 
Ser Leu Gin Thr Tyr Val Thr Gin 
970 975 



<210> 49 
<211> 1174 
<212> PRT 
15<213> SARS coronavirus 



<400> 49 

Asp Arg Cys Thr Thr Phe Asp Asp Val Gin Ala Pro Asn Tyr Thr Gin 
15 10 15 

2 0His Thr Ser Ser Met Arg Gly Val Tyr Tyr Pro Asp Glu He Phe Arg 
20 25 30 

Ser Asp Thr Leu Tyr Leu Thr Gin Asp Leu Phe Leu Pro Phe Tyr Ser 

35 40 45 

Asn Val Thr Gly Phe His Thr He Asn His Thr Phe Gly Asn Pro Val 
25 50 55 60 

He Pro Phe Lys Asp Gly He Tyr Phe Ala Ala Thr Glu Lys Ser Asn 
65 70 75 80 

Val Val Arg Gly Trrp Val Phe Gly Ser Thr Met Asn Asn Lys Ser Gin 
85 90 95 

30Ser Val He He He Asn Asn Ser Thr Asn Val Val He Arg Ala Cys 
100 105 110 

Asn Phe Glu Leu Cys Asp Asn Pro Phe Phe Ala Val Ser Lys Pro Met 

115 120 125 

Gly Thr Gin Thr His Thr Met He Phe Asp Asn Ala Phe Asn Cys Thr 
35 130 135 140 

Phe Glu Tyr He Ser Asp Ala Phe Ser Leu Asp Val Ser Glu Lys Ser 
145 150 155 160 

Gly Asn Phe Lys His Leu Arg Glu Phe Val Phe Lys Asn Lys Asp Gly 
165 170 175 

40Phe Leu Tyr Val Tyr Lys Gly Tyr Gin Pro He Asp Val Val Arg Asp 
180 185 190 
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Leu Pro Ser Gly Phe Asn Thr Leu Lys Pro lie Phe Lys Leu Pro Leu 

195 200 205 

Gly lie Asn lie Thr Asn Phe Arg Ala lie Leu Thr Ala Phe Ser Pro 
210 215 220 

5Ala Gin Asp lie Trp Gly Thr Ser Ala Ala Ala Tyr Phe Val Gly Tyr 
225 230 235 240 

Leu Lys Pro Thr Thr Phe Met Leu Lys Tyr Asp Glu Asn Gly Thr He 

245 250 255 

Thr Asp Ala Val Asp Cys Ser Gin Asn Pro Leu Ala Glu Leu Lys Cys 
10 . 260 265 270 

Ser Val Lys Ser Phe Glu He Asp Lys Gly He Tyr Gin Thr Ser Asn 

275 280 285 

Phe Arg Val Val Pro Ser Gly Asp Val Val Arg Phe Pro Asn He Thr 
290 295 300 

15Asn Leu Cys Pro Phe Gly Glu Val Phe Asn Ala Thr Lys Phe Pro Ser 
305 310 315 320 

Val Tyr Ala Trp Glu Arg Lys Lys He Ser Asn Cys Val Ala Asp Tyr 

325 330 335 

Ser Val Leu Tyr Asn Ser Thr Phe Phe Ser Thr Phe Lys Cys Tyr Gly 
20 340 345 350 

Val Ser Ala Thr Lys Leu Asn Asp Leu Cys Phe Ser Asn Val Tyr Ala 

355 360 365 

Asp Ser Phe Val Val Lys Gly Asp Asp Val Arg Gin He Ala Pro Gly 
370 375 380 

25Gln Thr Gly Val He Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe 
385 390 395 400 

Met Gly Cys Val Leu Ala Trp Asn Thr Arg Asn He Asp Ala Thr Ser 

405 410 415 

Thr Gly Asn Tyr Asn Tyr Lys Tyr Arg Tyr Leu Arg His Gly Lys Leu 
30 420 425 430 

Arg Pro Phe Glu Arg Asp He Ser Asn Val Pro Phe Ser Pro Asp Gly 

435 440 445 

Lys Pro Cys Thr Pro Pro Ala Leu Asn Cys Tyr Trp Pro Leu Asn Asp 
450 455 460 

35Tyr Gly Phe Tyr Thr Thr Thr Gly He Gly Tyr Gin Pro Tyr Arg Val 
465 470 475 480 

Val Val Leu Ser Phe Glu Leu Leu Asn Ala Pro Ala Thr Val Cys Gly 

485 490 495 

Pro Lys Leu Ser Thr Asp Leu He Lys Asn Gin Cys Val Asn Phe Asn 
40 500 505 510 

Phe Asn Gly Leu Thr Gly Thr Gly Val Leu Thr Pro Ser Ser Lys Arg 
515 520 525 
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Phe Gin Pro Phe Gin Gin Phe Gly Arg Asp Val Ser Asp Phe Thr Asp 

530 535 540 

Ser Val Arg Asp Pro Lys Thr Ser Glu He Leu Asp He Ser Pro Cys 

550 555 560 

5Ala Phe Gly Gly Val Ser Val He Thr Pro Gly Thr Asn Ala Ser Ser 
565 570 575 

Glu Val Ala Val Leu Tyr Gin Asp Val Asn Cys Thr Asp Val Ser Thr 

580 585 590 

Ala He His Ala Asp Gin Leu Thr Pro Ala Trp Arg He Tyr Ser Thr 
10 595 600 605 

Gly Asn Asn Val Phe Gin Thr Gin Ala Gly Cys Leu He Gly Ala Glu 

615 620 
His Val Asp Thr Ser Tyr Glu Cys Asp He Pro He Gly Ala Gly He 

630 635 640 

15Cys Ala Ser Tyr His Thr Val Ser Leu Leu Arg Ser Thr Ser Gin Lys 

S45 650 655 

Ser He Val Ala Tyr Thr Met Ser Leu Gly Ala Asp Ser Ser He Ala 

665 670 
Tyr Ser Asn Asn Thr He Ala He Pro Thr Asn Phe Ser He Ser He 
20 675 680 685 

Thr Thr Glu Val Met Pro Val Ser Met Ala Lys Thr Ser Val Asp Cys 

695 700 
Asn Met Tyr He Cys Gly Asp Ser Thr Glu Cys Ala Asn Leu Leu Leu 
705 710 715 720 

25Gln Tyr Gly Ser Phe Cys Thr Gin Leu Asn Arg Ala Leu Ser Gly He 

725 730 
Ala Ala Glu Gin Asp Arg Asn Thr Arg Glu Val Phe Ala Gin Val Lys 

740 745 750 

Gin Met Tyr Lys Thr Pro Thr Leu Lys Tyr Phe Gly Gly Phe Asn Phe 
30 755 760 765 

Ser Gin He Leu Pro Asp Pro Leu Lys Pro Thr Lys Arg Ser Phe He 

770 775 780 

Glu Asp Leu Leu Phe Asn Lys Val Thr Leu Ala Asp Ala Gly Phe Met 

790 795 800 

3 5Lys Gin Tyr Gly Glu Cys Leu Gly Asp He Asn Ala Arg Asp Leu He 

805 810 815 

Cys Ala Gin Lys Phe Asn Gly Leu Thr Val Leu Pro Pro Leu Leu Thr 

820 825 830 

Asp Asp Met He Ala Ala Tyr Thr Ala Ala Leu Val Ser Gly Thr Ala 
40 835 840 845 

Thr Ala Gly Trp Thr Phe Gly Ala Gly Ala Ala Leu Gin He Pro Phe 
850 855 860 
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Ala Met Gin Met Ala Tyr Arg Phe Asn Gly He Gly Val Thr Gin Asn 

870 875 880 

val Leu Tyr Glu Asn Gin Lys Gin He Ala Asn Gin Phe Asn Lys Ala 
885 890 
5Ile Ser Gin He Gin Glu Ser Leu Thr Thr Thr Ser Thr Ala Leu Gly 

905 

Lys Leu Gin Asp Val Val Asn Gin Asn Ala Gin Ala Leu Asn Thr Leu 

915 920 
Val Lys Gin Leu Ser Ser Asn Phe Gly Ala He Ser Ser Val Leu Asn 
^° 935 

Asp He Leu Ser Arg Leu Asp Lys Val Glu Ala Glu Val Gin He Asp 

Arg Leu He Thr Gly Arg Leu Gin Ser Leu Gin Thr Tyr Val Thr Gin 
965 970 
15Gln Leu He Arg Ala Ala Glu He Arg Ala Ser Ala Asn Leu Ala Ala 

985 990 
Thr Lys Met Ser Glu Cys Val Leu Gly Qln Ser Lys Arg Val Asp Phe 

995 1000 1005 

Cys Gly Lys Gly Tyr His Leu Met Ser Phe Pro Gin Ala Ala Pro His 
20 1010 1015 1020 

Gly val val Phe Leu His Val Thr Tyr Val Pro Ser Gin Glu Arg Asn 

^"0 1035 1040 

Phe Thr Thr Ala Pro Ala He Cys His Glu Gly Lys Ala Tyr Phe Pro 
1045 1050 1055 

25Arg Glu Gly Val Phe Val Phe Asn Gly Thr Ser Trp Phe He Thr Gin 
1060 1065 1070 

Arg Asn Phe Phe Ser Pro Gin He He Thr Thr Asp Asn Thr Phe Val 

1075 1080 1085 

Ser Gly Asn Cys Asp Val Val He Gly He He Asn Asn Thr Val Tyr 
^° ^°50 1095 1100 

Asp Pro Leu Gin Pro Glu Leu Asp Ser Phe Lys Glu Glu Leu Asp Lys 

Tyr Phe Lys Asn His Thr Ser Pro Asp Val Asp Leu Gly Asp He Ser 
1125 1130 1135 

35Gly He Asn Ala Ser Val Val Asn He Gin Lys Glu He Asp Arg Leu 
1140 1145 1150 

Asn Glu val Ala Lys Asn Leu Asn Glu Ser Leu He Asp Leu Gin Glu 

1155 1160 1165 

Leu Gly Lys Tyr Glu Gin 
40 1170 



wo 2005/010034 



PCT/US2004/023345 



56 

<210> 50 
<211> 260 
<212> PRT 

<213> SARS coronavirus 

5 

<400> 50 

Asp Arg Cys Thr Thr Phe Asp Asp Val Gin Ala Pro Asn Tyr Thr Gin 

15 10 15 

His Thr Ser Ser Met Arg Gly Val Tyr Tyr Pro Asp Glu lie Phe Arg 
10 20 25 30 

Ser Asp Thr Leu Tyr Leu Thr Gin Asp Leu Phe Leu Pro Phe Tyr Ser 

35 40 45 

Asn Val Thr Gly Phe His Thr lie Asn His Thr Phe Gly Asn Pro Val 
50 55 60 

15 lie Pro Phe Lys Asp Gly He Tyr Phe Ala Ala Thr Glu Lys Ser Asn 
65 70 75 80 

Val Val Arg Gly Trp Val Phe Gly Ser Thr Met Asn Asn Lys Ser Gin 

85 90 95 

Ser Val He He He Asn Asn Ser Thr Asn Val Val He Arg Ala Cys 
20 100 105 110 

Asn Phe Glu Leu Cys Asp Asn Pro Phe Phe Ala Val Ser Lys Pro Met 

115 120 125 

Gly Thr Gin Thr His Thr Met He Phe Asp Asn Ala Phe Asn Cys Thr 
130 135 140 

25Phe Glu Tyr He Ser Asp Ala Phe Ser Leu Asp Val Ser Glu Lys Ser 
145 150 155 160 

Gly Asn Phe Lys His Leu Arg Glu Phe Val Phe Lys Asn Lys Asp Gly 

165 170 175 

Phe Leu Tyr Val Tyr Lys Gly Tyr Gin Pro He Asp Val Val Arg Asp 
30 180 185 190 

Leu Pro Ser Gly Phe Asn Thr Leu Lys Pro He Phe Lys Leu Pro Leu 

195 200 205 

Gly He Asn He Thr Asn Phe Arg Ala He Leu Thr Ala Phe Ser Pro 
210 215 220 

3 5Ala Gin Asp He Trp Gly Thr Ser Ala Ala Ala Tyr Phe Val Gly Tyr 
225 230 235 240 

Leu Lys Pro Thr Thr Phe Met Leu Lys Tyr Asp Glu Asn Gly Thr He 
245 250 255 

Thr Asp Ala Val 
40 260 
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<210> 51 
<211> 430 
<212> PRT 

<213> SARS coronavirus 

5 

<400> 51 

Asp Arg Cys Thr Thr Pikie Asp Asp Val Gin Ala Pro Asn Tyr Thr Gin 

15 10 15 

His Thr Ser Ser Met Arg Gly Val Tyr Tyr Pro Asp Glu lie Phe Arg 
10 20 25 30 

Ser Asp Thr Leu Tyr Leu Thr Gin Asp Leu Phe Leu Pro Phe Tyr Ser 

35 40 45 

Asn Val Thr Gly Phe His Thr lie Asn His Thr Phe Gly Asn Pro Val 
50 55 60 

15 He Pro Phe Lys Asp Gly He Tyr Phe Ala Ala Thr Glu Lys Ser Asn 
65 70 75 80 

Val Val Arg Gly Trp Val Phe Gly Ser Thr Met Asn Asn Lys Ser Gin 

85 90 95 

Ser Val He He He Asn Asn Ser Thr Asn Val Val He Arg Ala Cys 
20 100 105 110 

Asn Phe Glu Leu Cys Asp Asn Pro Phe Phe Ala Val Ser Lys Pro Met 

115 120 125 

Gly Thr Gin Thr His Thr Met He Phe Asp Asn Ala Phe Asn Cys Thr 
130 135 140 

25Phe Glu Tyr He Ser Asp Ala Phe Ser Leu Asp Val Ser Glu Lys Ser 
145 150 155 160 

Gly Asn Phe Lys His Leu Arg Glu Phe Val Phe Lys Asn Lys Asp Gly 

165 170 175 

Phe Leu Tyr Val Tyr Lys Gly Tyr Gin Pro He Asp Val Val Arg Asp 
30 180 185 190 

Leu Pro Ser Gly Phe Asn Thr Leu Lys Pro He Phe Lys Leu Pro Leu 

195 200 205 

Gly He Asn He Thr Asn Phe Arg Ala He Leu Thr Ala Phe Ser Pro 
210 215 220 

35Ala Gin Asp He Trp Gly Thr Ser Ala Ala Ala Tyr Phe Val Gly Tyr 
225 230 235 240 

Leu Lys Pro Thr Thr Phe Met Leu Lys Tyr Asp Glu Asn Gly Thr He 

245 250 255 

Thr Asp Ala Val Asp Cys Ser Gin Asn Pro Leu Ala Glu Leu Lys Cys 
40 260 265 270 

Ser Val Lys Ser Phe Glu He Asp Lys Gly He Tyr Gin Thr Ser Asn 
275 280 285 
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Phe Arg Val Val 

290 

Asn Leu Cys Pro 
305 

5 Val Tyr Ala Trp 

Ser Val Leu Tyr 

340 

Val Ser Ala Thr 
10 355 
Asp Ser Phe Val 
370 

Gin Thr Gly Val 
385 

15Met Gly Cys Val 

Thr Gly Asn Tyr 
42 0 



Pro Ser Gly Asp 

295 

Phe Gly Glu Val 
310 

Glu Arg Lys Lys 
325 

Asn Ser Thr Phe 

Lys Leu Asn Asp 
360 

Val Lys Gly Asp 
375 

lie Ala Asp Tyr 
390 

Leu Ala Trp Asn 
405 

Asn Tyr Lys Tyr 



58 

Val Val Arg Phe 
300 

Phe Asn Ala Thr 
315 

lie Ser Asn Cys 
330 

Phe Ser Thr Phe 

345 

Leu Cys Phe Ser 

Asp Val Arg Gin 
380 

Asn Tyr Lys Leu 
395 

Thr Arg Asn He 
410 

Arg Tyr Leu Arg 
425 



Pro Asn He Thr 

Lys Phe Pro Ser 
320 

Val Ala Asp Tyr 
335 

Lys Cys Tyr Gly 
350 

Asn Val Tyr Ala 
365 

He Ala Pro Gly 

Pro Asp Asp Phe 
400 

Asp Ala Thr Ser 
415 

His Gly 
430 



20<210> 52 
<211> 521 
<212> PRT 

<213> SARS coronavirus 



25<400> 52 

Asp Arg Cys Thr Thr 

1 5 
His Thr Ser Ser Met 
20 

30Ser Asp Thr Leu Tyr 

35 

Asn Val Thr Gly Phe 
SO 

He Pro Phe Lys Asp 
3565 

Val Val Arg Gly Trp 

85 

Ser Val He He He 
100 

40 Asn Phe Glu Leu Cys 
115 



Phe Asp Asp Val Gin Ala 
10 

Arg Gly Val Tyr Tyr Pro 
25 

Leu Thr Gin Asp Leu Phe 

40 

His Thr He Asn His Thr 
55 

Gly He Tyr Phe Ala Ala 
70 75 
Val Phe Gly Ser Thr Met 

90 

Asn Asn Ser Thr Asn Val 
105 

Asp Asn Pro Phe Phe Ala 
120 



Pro Asn Tyr Thr Gin 
15 

Asp Glu He Phe Arg 
30 

Leu Pro Phe Tyr Ser 

45 

Phe Gly Asn Pro Val 
60 

Thr Glu Lys Ser Asn 
80 

Asn Asn Lys Ser Gin 

95 

Val He Arg Ala Cys 
110 

Val Ser Lys Pro Met 
125 
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Gly Thr Gin Thr 
130 

Phe Glu Tyr lie 
145 

5 Gly Asn Phe Lys 

Phe Leu Tyr Val 

180 

Leu Pro Ser Gly 
10 195 
Gly lie Asn He 
210 

Ala Gin Asp He 
225 

ISLeu Lys Pro Thr 

Thr Asp Ala Val 
260 

Ser Val Lys Ser 
20 275 

Phe Arg Val Val 
290 

Asn Leu Cys Pro 

305 

25Val Tyr Ala Trp 

Ser Val Leu Tyr 
340 

Val Ser Ala Thr 
30 355 

Asp Ser Phe Val 
370 

Gin Thr Gly Val 
385 

35Met Gly Cys Val 

Thr Gly Asn Tyr 
420 

Arg Pro Phe Glu 
40 435 

Lys Pro Cys Thr 
450 



His Thr Met He 
135 

Ser Asp Ala Phe 
150 

His Leu Arg Glu 
165 

Tyr Lys Gly Tyr 

Phe Asn Thr Leu 
200 

Thr Asn Phe Arg 

215 

Trp Gly Thr Ser 
230 

Thr Phe Met Leu 
245 

Asp Cys Ser Gin 

Phe Glu He Asp 
280 

Pro Ser Gly Asp 
295 

Phe Gly Glu Val 

310 

Glu Arg Lys Lys 
325 

Asn Ser Thr Phe 

Lys Leu Asn Asp 

360 

Val Lys Gly Asp 
375 

He Ala Asp Tyr 
390 

Leu Ala Trp Asn 

405 

Asn Tyr Lys Tyr 

Arg Asp He Ser 
440 

Pro Pro Ala Leu 
455 



59 

Phe Asp Asn Ala 

140 

Ser Leu Asp Val 
155 

Phe Val Phe Lys 
170 

Gin Pro He Asp 

185 

Lys Pro He Phe 

Ala He Leu Thr 

220 

Ala Ala Ala Tyr 
235 

Lys Tyr Asp Glu 
250 

Asn Pro Leu Ala 

265 

Lys Gly He Tyr 

Val Val Arg Phe 
300 

Phe Asn Ala Thr 
315 

He Ser Asn Cys 
330 

Phe Ser Thr Phe 
345 

Leu Cys Phe Ser 

Asp Val Arg Gin 
380 

Asn Tyr Lys Leu 
3 95 

Thr Arg Asn He 

410 

Arg Tyr Leu Arg 
425 

Asn Val Pro Phe 

Asn Cys Tyr Trp 
460 



Phe Asn Cys Thr 

Ser Glu Lys Ser 
160 

Asn Lys Asp Gly 
175 

Val Val Arg Asp 
190 

Lys Leu Pro Leu 
205 

Ala Phe Ser Pro 

Phe Val Gly Tyr 
240 

Asn Gly Thr He 
255 

Glu Leu Lys Cys 
270 

Gin Thr Ser Asn 
285 

Pro Asn He Thr 

Lys Phe Pro Ser 
320 

Val Ala Asp Tyr 
335 

Lys Cys Tyr Gly 
350 

Asn Val Tyr Ala 

365 

He Ala Pro Gly 

Pro Asp Asp Phe 
400 

Asp Ala Thr Ser 

415 

His Gly Lys Leu 
430 

Ser Pro Asp Gly 
445 

Pro Leu Asn Asp 
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Tyr Gly Phe Tyr Thr Thr Thr Gly lie Gly Tyr Gin Pro Tyr Arg Val 
465 470 475 480 

Val Val Leu Ser Phe Glu Leu Leu Asn Ala Pro Ala Thr Val Cys Gly 
485 490 495 

5Pro Lys Leu Ser Thr Asp Leu lie Lys Asn Gin Cys Val Asn Phe Asn 
500 505 510 

Phe Asn Gly Leu Thr Gly Thr Gly Val 
515 520 

10<210> 53 
<211> 777 
<212> PRT 

<213> Artificial Sequence 

15<220> 

<223> Synthetic sequence of amino acids 17-757 of SEQ ID NO : 1 plus an 
N-terminal mouse K chain leader sequence and a C-terminal myc 
epitope and a polyhistidine tag 

20<400> 53 

Met Glu Thr Asp Thr Leu Leu Leu Trp Val Leu Leu Leu Trp Val Pro 

15 10 15 

Gly Ser Thr Gly Asp Asp Arg Cys Thr Thr Phe Asp Asp Val Gin Ala 
20 25 30 

25Pro Asn Tyr Thr Gin His Thr Ser Ser Met Arg Gly Val Tyr Tyr Pro 
35 40 45 

Asp Glu He Phe Arg Ser Asp Thr Leu Tyr Leu Thr Gin Asp Leu Phe 

50 55 60 

Leu Pro Phe Tyr Ser Asn Val Thr Gly Phe His Thr He Asn His Thr 
3065 70 75 80 

Phe Gly Asn Pro Val He Pro Phe Lys Asp Gly He Tyr Phe Ala Ala 

85 90 95 

Thr Glu Lys Ser Asn Val Val Arg Gly Trp Val Phe Gly Ser Thr Met 
100 105 110 

35Asn Asn Lys Ser Gin Ser Val He He He Asn Asn Ser Thr Asn Val 
115 120 125 

Val He Arg Ala Cys Asn Phe Glu Leu Cys Asp Asn Pro Phe Phe Ala 

130 135 140 

Val Ser Lys Pro Met Gly Thr Gin Thr His Thr Met He Phe Asp Asn 
40145 150 155 160 

Ala Phe Asn Cys Thr Phe Glu Tyr He Ser Asp Ala Phe Ser Leu Asp 
165 170 175 
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Val Ser Glu Lys Ser Gly Asn Phe Lys His Leu Arg Glu Phe Val Phe 

180 185 190 

Lys Asn Lys Asp Gly Phe Leu Tyr Val Tyr Lys Gly Tyr Gin Pro He 
195 200 205 

5Asp Val Val Arg Asp Leu Pro Ser Gly Phe Asn Thr Leu Lys Pro He 
210 215 220 

Phe Lys Leu Pro Leu Gly He Asn He Thr Asn Phe Arg Ala He Leu 
225 230 235 240 

Thr Ala Phe Ser Pro Ala Gin Asp He Trp Gly Thr Ser Ala Ala Ala 
10 245 250 255 

Tyr Phe Val Gly Tyr Leu Lys Pro Thr Thr Phe Met Leu Lys Tyr Asp 

260 265 270 

Glu Asn Gly Thr He Thr Asp Ala Val Asp Cys Ser Gin Asn Pro Leu 
275 280 285 

15Ala Glu Leu Lys Cys Ser Val Lys Ser Phe Glu He Asp Lys Gly He 
290 295 300 

Tyr Gin Thr Ser Asn Phe Arg Val Val Pro Ser Gly Asp Val Val Arg 
305 310 315 320 

Phe Pro Asn He Thr Asn Leu Cys Pro Phe Gly Glu Val Phe Asn Ala 
20 325 330 335 

Thr Lys Phe Pro Ser Val Tyr Ala Trp Glu Arg Lys Lys He Ser Asn 

340 345 350 

Cys Val Ala Asp Tyr Ser Val Leu Tyr Asn Ser Thr Phe Phe Ser Thr 
355 360 365 

25Phe Lys Cys Tyr Gly Val Ser Ala Thr Lys Leu Asn Asp Leu Cys Phe 
370 375 380 

Ser Asn Val Tyr Ala Asp Ser Phe Val Val Lys Gly Asp Asp Val Arg 
385 390 395 400 

Gin He Ala Pro Gly Gin Thr Gly Val He Ala Asp Tyr Asn Tyr Lys 
30 405 410 415 

Leu Pro Asp Asp Phe Met Gly Cys Val Leu Ala Trp Asn Thr Arg Asn 

420 425 430 

He Asp Ala Thr Ser Thr Gly Asn Tyr Asn Tyr Lys Tyr Arg Tyr Leu 
435 440 445 

3 5Arg His Gly Lys Leu Arg Pro Phe Glu Arg Asp He Ser Asn Val Pro 
450 455 460 

Phe Ser Pro Asp Gly Lys Pro Cys Thr Pro Pro Ala Leu Asn Cys Tyr 
465 470 475 480 

Trp Pro Leu Asn Asp Tyr Gly Phe Tyr Thr Thr Thr Gly He Gly Tyr 
40 485 490 495 

Gin Pro Tyr Arg Val Val Val Leu Ser Phe Glu Leu Leu Asn Ala Pro 
500 505 510 
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Ala Thr Val Cys Gly Pro Lys Leu Ser Thr Asp Leu He Lys Asn Gin 

515 520 525 

Cys Val Asn Phe Asn Phe Asn Gly Leu Thr Gly Thr Gly Val Leu Thr 
530 535 540 

5Pro Ser Ser Lys Arg Phe Gin Pro Phe Gin Gin Phe Gly Arg Asp Val 
545 550 555 560 

Ser Asp Phe Thr Asp Ser Val Arg Asp Pro Lys Thr Ser Glu He Leu 

565 570 575 

Asp lie Ser Pro Cys Ala Phe Gly Gly Val Ser Val He Thr Pro Gly 
10 580 585 590 

Thr Asn Ala Ser Ser Glu Val Ala Val Leu Tyr Gin Asp Val Asn Cys 

595 600 605 

Thr Asp Val Ser Thr Ala He His Ala Asp Gin Leu Thr Pro Ala Trp 
610 615 620 

15Arg He Tyr Ser Thr Gly Asn Asn Val Phe Gin Thr Gin Ala Gly Cys 
625 630 635 ^ 640 

Leu He Gly Ala Glu His Val Asp Thr Ser Tyr Glu Cys Asp He Pro 

645 650 655 

He Gly Ala Gly He Cys Ala Ser Tyr His Thr Val Ser Leu Leu Arg 
20 660 665 670 

Ser Thr Ser Gin Lys Ser He Val Ala Tyr Thr Met Ser Leu Gly Ala 

675 680 685 

Asp Ser Ser He Ala Tyr Ser Asn Asn Thr He Ala He Pro Thr Asn 
690 695 700 

25Phe Ser He Ser He Thr Thr Glu Val Met Pro Val Ser Met Ala Lys 
705 710 715 720 

Thr Ser Val Asp Cys Asn Met Tyr He Cys Gly Asp Ser Thr Glu Cys 

725 730 735 

Ala Asn Leu Leu Leu Gin Tyr Gly Ser Phe Cys Thr Gin Leu Asn Arg 
30 740 745 750 

Ala Leu Ser Gly He Ala Ala Glu Gin Glu Gin Lys Leu He Ser Glu 

755 760 765 

Glu Asp Leu His His His His His His 
770 775 

35 

<210> 54 

<211> 297 
<212> PRT 

<213> Artificial Sequence 

40 

<220> 

<223> Synthetic sequence of amino acids 17-276 of SEQ ID MO:l plus an 
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N-terminal mouse K chain leader sequence and a C-terminal myc 
epitope and a polyhistidine tag 

<400> 54 

5Met Glu Thr Asp Thr Leu Leu Leu Trp Val Leu Leu ,Leu Trp Val Pro 
15 10 15 

Gly Ser Thr Gly Asp Asp Arg Cys Thr Thr Phe Asp Asp Val Gin Ala 

20 25 30 

Pro Asn Tyr Thr Gin His Thr Ser Ser Met Arg Gly Val Tyr Tyr Pro 
10 35 40 45 

Asp Glu He Phe Arg Ser Asp Thr Leu Tyr Leu Thr Gin Asp Leu Phe 

50 55 60 

Leu Pro Phe Tyr Ser Asn Val Thr Gly Phe His Thr He Asn His Thr 
65 70 75 80 

IBPhe Gly Asn Pro Val He Pro Phe Lys Asp Gly He Tyr Phe Ala Ala 

85 90 95 

Thr Glu Lys Ser Asn Val Val Arg Gly Trp Val Phe Gly Ser Thr Met 

100 105 110 

Asn Asn Lys Ser Gin Ser Val He He He Asn Asn Ser Thr Asn Val 
20 115 120 125 

Val He Arg Ala Cys Asn Phe Glu Leu Cys Asp Asn Pro Phe Phe Ala 

130 135 140 

Val Ser Lys Pro Met Gly Thr Gin Thr His Thr Met He Phe Asp Asn 
145 150 155 160 

2 5 Ala Phe Asn Cys Thr Phe Glu Tyr He Ser Asp Ala Phe Ser Leu Asp 

165 170 175 

Val Ser Glu Lys Ser Gly Asn Phe Lys His Leu Arg Glu Phe Val Phe 

180 185 190 

Lys Asn Lys Asp Gly Phe Leu Tyr Val Tyr Lys Gly Tyr Gin Pro He 
30 195 200 205 

Asp Val Val Arg Asp Leu Pro Ser Gly Phe Asn Thr Leu Lys Pro He 

210 215 220 

Phe Lys Leu Pro Leu Gly He Asn He Thr Asn Phe Arg Ala He Leu 
225 230 235 240 

35Thr Ala Phe Ser Pro Ala Gin Asp He Trp Gly Thr Ser Ala Ala Ala 

245 250 255 

Tyr Phe Val Gly Tyr Leu Lys Pro Thr Thr Phe Met Leu Lys Tyr Asp 

260 265 270 

Glu Asn Gly Thr He Thr Asp Ala Val Glu Gin Lys Leu He Ser Glu 
40 275 280 285 

Glu Asp Leu His His His His His His 
290 295 
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<210> 55 
<211> 558 
<212> PRT 

<213> Artificial Sequence 

5 

<220> 

<223> A synthetic sequence of amino acids 17-537 of SEQ ID NO:l plus an 
N-terminal mouse K chain leader sequence and a C-terminal myc 
epitope and a polyhistidine tag 

10 

<400> 55 

Met Glu Thr Asp Thr Leu Leu Leu Trp Val Leu Leu Leu Trp Val Pro 

15 10 15 

Gly Ser Thr Gly Asp Asp Arg Cys Thr Thr Phe Asp Asp Val Gin Ala 
15 20 25 30 ) 

Pro Asn Tyr Thr Gin His Thr Ser Ser Met Arg Gly Val Tyr Tyr Pro 

35 40 45 

Asp Glu He Phe Arg Ser Asp Thr Leu Tyr Leu Thr Gin Asp Leu Phe 
50 55 60 

2 0Leu Pro Phe Tyr Ser Asn Val Thr Gly Phe His Thr He Asn His Thr 
65 70 75 80 

Phe Gly Asn Pro Val He Pro Phe Lys Asp Gly He Tyr Phe Ala Ala 

85 90 95 

Thr Glu Lys Ser Asn Val Val Arg Gly Trp Val Phe Gly Ser Thr Met 
^ 25 100 105 110 

Asn Asn Lys Ser Gin Ser Val He He He Asn Asn Ser Thr Asn Val 

115 120 125 

Val He Arg Ala Cys Asn Phe Glu Leu Cys Asp Asn Pro Phe Phe Ala 
130 135 140 

3 oval Ser Lys Pro Met Gly Thr Gin Thr His Thr Met He Phe Asp Asn 
145 150 155 ' 160 

Ala Phe Asn Cys Thr Phe Glu Tyr He Ser Asp Ala Phe Ser Leu Asp 

165 170 175 

Val Ser Glu Lys Ser Gly Asn Phe Lys His Leu Arg Glu Phe Val Phe 
35 180 185 190 

Lys Asn Lys Asp Gly Phe Leu Tyr Val Tyr Lys Gly Tyr Gin Pro He 

195 200 205 

Asp Val Val Arg Asp Leu Pro Ser Gly Phe Asn Thr Leu Lys Pro He 
210 215 220 

4 0Phe Lys Leu Pro Leu Gly He Asn He Thr Asn Phe Arg Ala He Leu 
225 230 235 240 
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Thr Ala Phe Ser 

Tyr Phe Val Gly 
260 

BGlu Asn Gly Thr 
275 

Ala Glu Leu Lys 
290 

Tyr Gin Thr Ser 
10305 

Phe Pro Asn lie 

Thr Lys Phe Pro 
340 

ISCys Val Ala Asp 
355 

Phe Lys Cys Tyr 
370 

Ser Asn Val Tyr 
20385 

Gin He Ala Pro 

Leu Pro Asp Asp 

420 

2 5 He Asp Ala Thr 
435 

Arg His Gly Lys 
450 

Phe Ser Pro Asp 
30465 

Trp Pro Leu Asn 

Gin Pro Tyr Arg 
500 

3 5 Ala Thr Val Cys 

515 

Cys Val Asn Phe 
530 

Lys Leu He Ser 
40545 



Pro Ala Gin 
245 

Tyr Leu Lys 

He Thr Asp 

Cys Ser Val 
295 

Asn Phe Arg 

310 
Thr Asn Leu 

325 

Ser Val Tyr 

Tyr Ser Val 

Gly Val Ser 
375 

Ala Asp Ser 

390 
Gly Gin Thr 
405 

Phe Met Gly 

Ser Thr Gly 

Leu Arg Pro 
455 

Gly Lys Pro 

470 
Asp Tyr Gly 
485 

Val Val Val 

Gly Pro Lys 

Asn Phe Asn 
535 

Glu Glu Asp 
550 



65 

Asp He Trp 
250 

Pro Thr Thr 

265 
Ala Val Asp 
280 

Lys Ser Phe 
Val Val Pro 

Cys Pro Phe 

330 

Ala Trp Glu 

345 
Leu Tyr Asn 
360 

Ala Thr Lys 

Phe Val Val 

Gly Val He 
410 

Cys Val Leu 

425 

Asn Tyr Asn 
440 

Phe Glu Arg 

Cys Thr Pro 

Phe Tyr Thr 
490 

Leu Ser Phe 

505 
Leu Ser Thr 

520 

Gly Leu Thr 
Leu His His 



Gly Thr Ser 

Phe Met Leu 

Cys Ser Gin 
285 

Glu He Asp 

300 
Ser Gly Asp 
315 

Gly Glu Val 

Arg Lys Lys 

Ser Thr Phe 
365 

Leu Asn Asp 

380 
Lys Gly Asp 
3 95 

Ala Asp Tyr 

Ala Trp Asn 

Tyr Lys Tyr 
445 

Asp He Ser 

460 
Pro Ala Leu 

475 

Thr Thr Gly 
Glu Leu Leu 

Asp Leu He 

525 

Gly Thr Gly 
540 

His His His 
555 



Ala Ala Ala 

255 
Lys Tyr Asp 
270 

Asn Pro Leu 

Lys Gly He 

Val Val Arg 
320 

Phe Asn Ala 

335 
He Ser Asn 
350 

Phe Ser Thr 

Leu Cys Phe 

Asp Val Arg 
400 

Asn Tyr Lys 

415 
Thr Arg Asn 
430 

Arg Tyr Leu 

Asn Val Pro 

Asn Cys Tyr 
' 480 
He Gly Tyr 

495 
Asn Ala Pro 
510 

Lys Asn Gin 
Val Glu Gin 
His 
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<210> 56 
<211> 739 
<212> PRT 

<213> Artificial Sequence 

5 

<220> 

<223> A synthetic sequence of amino acids 17-7 56 of SEQ ID NO:l 
without a signal peptide at the N-terminus 

10<400> 56 

Asp Arg Cys Thr Thr Phe Asp Asp Val Gin Ala Pro Asn Tyr Thr Gin 

15 10 15 

His Thr Ser Ser Met Arg Gly Val Tyr Tyr Pro Asp Glu lie Phe Arg 
20 25 30 

15 Ser Asp Thr Leu Tyr Leu Thr Gin Asp Leu Phe Leu Pro Phe Tyr Ser 
35 40 45 

Asn Val Thr Gly Phe His Thr lie Asn His Thr Phe Gly Asn Pro Val 

50 55 60 

lie Pro Phe Lys Asp Gly He Tyr Phe Ala Ala Thr Glu Lys Ser Asn 
2065 70 75 80 

Val Val Arg Gly Trp Val Phe Gly Ser Thr Met Asn Asn Lys Ser Gin 

85 90 95 

Ser Val He He He Asn Asn Ser Thr Asn Val Val He Arg Ala Cys 
100 105 110 

25Asn Phe Glu Leu Cys Asp Asn Pro Phe Phe Ala Val Ser Lys Pro Met 
115 120 125 

Gly Thr Gin Thr His Thr Met He Phe Asp Asn Ala Phe Asn Cys Thr 

130 135 140 

Phe Glu Tyr He Ser Asp Ala Phe Ser Leu Asp Val Ser Glu Lys Ser 
30145 150 155 160 

Gly Asn Phe Lys His Leu Arg Glu Phe Val Phe Lys Asn Lys Asp Gly 

165 170 175 

Phe Leu Tyr Val Tyr Lys Gly Tyr Gin Pro He Asp Val Val Arg Asp 
180 185 190 

35Leu Pro Ser Gly Phe Asn Thr Leu Lys Pro He Phe Lys Leu Pro Leu 
195 200 205 

Gly He Asn He Thr Asn Phe Arg Ala He Leu Thr Ala Phe Ser Pro 

210 215 220 

Ala Gin Asp He Trp Gly Thr Ser Ala Ala Ala Tyr Phe Val Gly Tyr 
40225 230 235 240 

Leu Lys Pro Thr Thr Phe Met Leu Lys Tyr Asp Glu Asn Gly Thr He 
245 250 255 
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Thr Asp Ala Val 
260 

Ser Val Lys Ser 
275 

5Phe Arg Val Val 
290 

Asn Leu Cys Pro 

305 

Val Tyr Ala Trp 

10 

Ser Val Leu Tyr 

340 

Val Ser Ala Thr 
355 

15Asp Ser Phe Val 
370 

Gin Thr Gly Val 
385 

Met Gly Cys Val 

20 

Thr Gly Asn Tyr 
420 

Arg Pro Phe Glu 

435 

25Lys Pro Cys Thr 
450 

Tyr Gly Phe Tyr 
4 65 

Val Val Leu Ser 

30 

Pro Lys Leu Ser 
500 

Phe Asn Gly Leu 
515 

3 5 Phe Gin Pro Phe 

530 

Ser Val Arg Asp 
545 

Ala Phe Gly Gly 

40 

Glu Val Ala Val 
580 



Asp Cys Ser Gin 

Phe Glu lie Asp 
280 

Pro Ser Gly Asp 
295 

Phe Gly Glu Val 
310 

Glu Arg Lys Lys 
325 

Asn Ser Thr Phe 

Lys Leu Asn Asp 
360 

Val Lys Gly Asp 
375 

lie Ala Asp Tyr 
390 

Leu Ala Trp Asn 
405 

Asn Tyr Lys Tyr 
Arg Asp lie Ser 

440 

Pro Pro Ala Leu 
455 

Thr Thr Thr Gly 
470 

Phe Glu Leu Leu 
485 

Thr Asp Leu lie 

Thr Gly Thr Gly 
520 

Gin Gin Phe Gly 

535 

Pro Lys Thr Ser 
550 

Val Ser Val lie 
565 

Leu Tyr Gin Asp 
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Asn Pro Leu Ala 
265 

Lys Gly lie Tyr 

Val Val Arg Phe 
300 

Phe Asn Ala Thr 
315 

lie Ser Asn Cys 
330 

Phe Ser Thr Phe 

345 

Leu Cys Phe Ser 

Asp Val Arg Gin 
380 

Asn Tyr Lys Leu 

395 

Thr Arg Asn lie 
410 

Arg Tyr Leu Arg 
425 

Asn Val Pro Phe 

Asn Cys Tyr Trp 
460 

lie Gly Tyr Gin 
475 

Asn Ala Pro Ala 

490 

Lys Asn Gin Cys 
505 

Val Leu Thr Pro 

Arg Asp Val Ser 

540 

Glu lie Leu Asp 
555 

Thr Pro Gly Thr 
570 

Val Asn Cys Thr 
585 



Glu Leu Lys Cys 
270 

Gin Thr Ser Asn 
285 

Pro Asn lie Thr 

Lys Phe Pro Ser 
320 

Val Ala Asp Tyr 
335 

Lys Cys Tyr Gly 
350 

Asn Val Tyr Ala 
365 

lie Ala Pro Gly 

Pro Asp Asp Phe 

400 

Asp Ala Thr Ser 
415 

His Gly Lys Leu 
430 

Ser Pro Asp Gly 

445 

Pro Leu Asn Asp 

Pro Tyr Arg Val 
480 

Thr Val Cys Gly 

495 

Val Asn Phe Asn 
510 

Ser Ser Lys Arg 
525 

Asp Phe Thr Asp 

lie Ser Pro Cys 
560 

Asn Ala Ser Ser 
575 

Asp Val Ser Thr 
590 
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Ala lie His Ala 

595 

Gly Asn Asn Val 
610 

5His Val Asp Thr 
625 

Cys Ala Ser Tyr 

Ser lie Val Ala 
10 660 
Tyr Ser Asn Asn 
675 

Thr Thr Glu Val 

690 

ISAsn Met Tyr lie 
705 

Gin Tyr Gly Ser 
Ala Ala Glu 

20 



Asp Gin Leu Thr 

600 

Phe Gin Thr Gin 
615 

Ser Tyr Glu Cys 
630 

His Thr Val Ser 

645 

Tyr Thr Met Ser 

Thr lie Ala lie 
680 

Met Pro Val Ser 
695 

Cys Gly Asp Ser 
710 

Phe Cys Thr Gin 
725 



68 

Pro Ala Trp Arg 

Ala Gly Cys Leu 
62 0 

Asp lie Pro lie 
635 

Leu Leu Arg Ser 
650 

Leu Gly Ala Asp 
665 

Pro Thr Asn Phe 

Met Ala Lys Thr 
700 

Thr Glu Cys Ala 
715 

Leu Asn Arg Ala 
730 



lie Tyr Ser Thr 

605 

lie Gly Ala Glu 

Gly Ala Gly He 
640 

Thr Ser Gin Lys 
655 

Ser Ser He Ala 
670 

Ser He Ser He 

685 

Ser Val Asp Cys 

Asn Leu Leu Leu 
720 

Leu Ser Gly He 
735 



<210> 57 
<211> 265 
<212> PRT 
25<213> Artificial Seqfuence 



<220> 

<223> A synthetic sequence of amino acids 272-537 of SBQ ID NO:l 



30<400> 57 

He Thr Asp Ala Val 

1 5 
Cys Ser Val Lys Ser 
20 

3 5 Asn Phe Arg Val Val 

35 

Asn Leu Cys Pro Phe 
50 

Val Tyr Ala Trp Glu 
4065 

Ser Val Leu Tyr Asn 
85 



Asp Cys Ser Gin Asn 
10 

Phe Glu He Asp Lys 
25 

Pro Ser Gly Asp Val 

40 

Gly Glu Val Phe Asn 
55 

Arg Lys Lys He Ser 
70 

Ser Thr Phe Phe Ser 
90 



Pro Leu Ala Glu Leu Lys 
15 

Gly He Tyr Gin Thr Ser 
30 

Val Arg Phe Asn He Thr 

45 

Ala Thr Lys Phe Pro Ser 
60 

Asn Cys Val Ala Asp Tyr 
75 80 
Thr Phe Lys Cys Tyr Gly 
95 
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Val Ser Ala Thr 
100 

Asp Ser Phe Val 
115 

5 Gin Thr Gly Val 
130 

Met Gly Cys Val 

145 

Thr Gly Asn Tyr 

10 

Arg Pro Phe Glu 
180 

Lys Pro Cys Thr 

195 

15 Tyr Gly Phe Tyr 
210 

Val Val lie-u Ser 
225 

Pro Lys Leu Ser 

20 

Phe Asn Gly Leu 
260 



Lys Leu Asn Asp 

Val Lys Gly Asp 
12 0 

lie Ala Asp Tyr 
135 

Leu Ala Trp Asn 
150 

Asn Tyr Lys Tyr 
165 

Arg Asp lie Ser 

Pro Pro Ala Leu 
200 

Thr Thr Thr Gly 
215 

Phe Glu Leu Leu 

230 

Thr Asp Leu lie 
245 

Thr Gly Thr Gly 



69 

Leu Cys Phe Ser 
105 

Asp Val Arg Gin 

Asn Tyr Lys Leu 
140 

Thr Arg Asn lie 
155 

Arg Tyr Leu Arg 
170 

Asn Val Pro Phe 

185 

Asn Cys Tyr Trp 

lie Gly Tyr Gin 
220 

Asn Ala Pro Ala 

235 

Lys Asn Gin Cys 
250 

Val 
265 



Asn Val Tyr Ala 
110 

lie Ala Pro Gly 
125 

Pro Asp Asp Phe 

Asp Ala Thr Ser 
160 

His Gly Lys Leu 
175 

Ser Pro Asp Gly 

190 

Pro Leu Asn Asp 
205 

Pro Tyr Arg Val 

Thr Val Cys Gly 
240 

Val Asn Phe Asn 
255 



<210> 58 
25<211> 17 
<212> PRT 

<213> SARS coronavirus 



<400> 58 

30 Asp Val Gin Ala Pro Asn Tyr Thr Gin His Thr Ser Ser Met Arg Gly 
15 10 15 

Cys 



35<210> 59 

<211> 15 
<212> PRT 

<213> SARS coronavirus 



40<400> 59 

Pro Ser Ser Lys Arg Phe Gin Pro Gin Gin Phe Gly Arg Asp Cys 
15 10 15 
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70 

<210> 60 
<211> 16 
<212> PRT 

<213> SARS coronavirus 

5 

<400> 60 

Met Phe He Phe Leu Leu Phe Leu Thr Leu Tlir Ser Gly Ser Asp Leu 
15 10 15 

10<210> 61 
<211> 235 
<212> PRT 

<213> Artificial Secfuence 
15<220> 

<223> A synthetic sequence of amino acids 303-537 of SEQ ID NO:l 
' containing the receptor binding domain 

<400> 61 

2 0Ser Asn Phe Arg Val Val Pro Ser Gly Asp Val Val Arg Phe Pro Asn 

15 10 15 

He Thr Asn Leu Cys Pro Phe Gly Glu Val Phe Asn Ala Thr Lys Phe 

20 25 30 

Pro Ser Val Tyr Ala Trp Glu Arg Lys Lys He Ser Asn Cys Val Ala 
25 35 40 45 

Asp Tyr Ser Val Leu Tyr Asn Ser Thr Phe Phe Ser Thr Phe Lys Cys 

50 55 60 

Tyr Gly Val Ser Ala Thr Lys Leu Asn Asp Leu Cys Phe Ser Asn Val 
65 70 75 80 

3 0Tyr Ala Asp Ser Phe Val Val Lys Gly Asp Asp Val Arg Gin He Ala 

85 90 95 

Pro Gly Gin Thr Gly Val He Ala Asp Tyr Asn Tyr Lys Leu Pro Asp 

100 105 110 

Asp Phe Met Gly Cys Val Leu Ala Trp Asn Thr Arg Asn He Asp Ala 
35 115 120 125 

Thr Ser Thr Gly Asn Tyr Asn Tyr Lys Tyr Arg Tyr Leu Arg His Gly 

130 135 140 

Lys Leu Arg Pro Phe Glu Arg Asp He Ser Asn Val Pro Phe Ser Pro 
145 150 155 160 

40Asp Gly Lys Pro Cys Thr Pro Pro Ala Leu Asn Cys Tyr Trp Pro Leu 

165 170 175 
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Asn Asp Tyr Gly Phe Tyr Thr Thr Thr Gly He Gly Tyr Gin Pro Tyr 

180 185 190 

Arg Val Val Val Leu Ser Phe Glu Leu Leu Asn Ala Pro Ala Thr Val 
195 200 205 

5Cys Gly Pro Lys Leu Ser Thr Asp Leu He Lys Asn Gin Cys Val Asn 
210 215 220 

Phe Asn Phe Asn Gly Leu Thr Gly Thr Gly Val 
225 230 235 

10<210> 62 
<211> 199 
<212> PRT 

<213> Artificial Sequence 
15<220> 

<223> A synthetic sequence of amino acids 319-517 of SEQ ID NO:l 
containing the receptor binding domain 

<400> 62 

2 0Ile Thr Asn Leu Cys Pro Phe Gly Glu Val Phe Asn Ala Thr Lys Phe 

1 5 10 15 

Pro Ser Val Tyr Ala Trp Glu Arg Lys Lys lie Ser Asn Cys Val Ala 

20 25 30 

Asp Tyr Ser Val Leu Tyr Asn Ser Thr Phe Phe Ser Thr Phe Lys Cys 
25 35 40 45 

Tyr Gly Val Ser Ala Thr Lys Leu Asn Asp Leu Cys Phe Ser Asn Val 

50 55 60 

Tyr Ala Asp Ser Phe Val Val Lys Gly Asp Asp Val Arg Gin lie Ala 
65 70 75 80 

3 0Pro Gly Gin Thr Gly Val lie Ala Asp Tyr Asn Tyr Lys Leu Pro Asp 

85 90 95 

Asp Phe Met Gly Cys Val Leu Ala Trp Asn Thr Arg Asn lie Asp Ala 

100 105 110 

Thr Ser Thr Gly Asn Tyr Asn Tyr Lys Tyr Arg Tyr Leu Arg His Gly 
35 115 120 125 

Lys Leu Arg Pro Phe Glu Arg Asp He Ser Asn Val Pro Phe Ser Pro 

130 135 140 

Asp Gly Lys Pro Cys Thr Pro Pro Ala Leu Asn Cys Tyr Trp Pro Leu 
145 150 155 160 

40 Asn Asp Tyr Gly Phe Tyr Thr Thr Thr Gly He Gly Tyr Gin Pro Tyr 

165 170 175 
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Arg Val Val Val Leu Ser Plie Glu Leu Leu Asn Ala Pro Ala Thr Val 

180 185 190 

Cys Gly Pro' Lys Leu Ser Thr 
195 

5 

<210> 63 
<211> 200 
<212> PRT 

<213> Artificial Sequence 

.0 

<220> 

<223> A synthetic sequence of amino acids 319-518 of SEQ ID NO : 1 
containing the receptor binding domain 



15<400> 63 

lie Thr Asn Leu Cys Pro Phe Gly Glu Val Phe Asn Ala Thr Lys Phe 

15 10 15 

Pro Ser Val Tyr Ala Trp Glu Arg Lys Lys He Ser Asn Cys Val Ala 
20 25 30 

2 0Asp Tyr Ser Val Leu Tyr Asn Ser Thr Phe Phe Ser Thr Phe Lys Cys 

35 40 45 

Tyr Gly Val Ser Ala Thr Lys Leu Asn Asp Leu Cys Phe Ser Asn Val 

50 55 60 

Tyr Ala Asp Ser Phe Val Val Lys Gly Asp Asp Val Arg Gin lie Ala 
2565 70 75 80 

Pro Gly Gin Thr Gly Val He Ala Asp Tyr Asn Tyr Lys Leu Pro Asp 

85 90 95 

Asp Phe Met Gly Cys Val Leu Ala Trp Asn Thr Arg Asn He Asp Ala 
100 105 110 

3 0Thr Ser Thr Gly Asn Tyr Asn Tyr Lys Tyr Arg Tyr Leu Arg His Gly 

115 120 125 

Lys Leu Arg Pro Phe Glu Arg Asp He Ser Asn Val Pro Phe Ser Pro 

130 135 140 

Asp Gly Lys Pro Cys Thr Pro Pro Ala Leu Asn Cys Tyr Trp Pro Leu 
35145 150 155 160 

Asn Asp Tyr Gly Phe Tyr Thr Thr Thr Gly He Gly Tyr Gin Pro Tyr 

165 170 175 

Arg Val Val Val Leu Ser Phe Glu Leu Leu Asn Ala Pro Ala Thr Val 
180 185 190 

4 0Cys Gly Pro Lys Leu Ser Thr Asp 

195 200 
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<210> 64 
<211> 23 
<212> DNA 

<213> Artificial Sequence 

5 

<220> 

<223> A synthetic primer 

<400> 64 
lOgatcggatcc ggtacaatca cag 

<210> 65 
<211> 23 
<212> DNA 
15<213> Artificial Sequence 

<220> 

<223> A synthetic primer 

20<400> 65 

gatcgggccc gacacactgg ttc 



73 



23 



23 



INTERNATIONAL SEARCH REPORT 



l^^ational Application No 

m/US2004/023345 



A. CLASSIFICATION OF SUBJECT MATTER 

IPC 7 C07K14/165 A61K39/215 A61K39/42 A61K38/16 



According to International Patent Classification (IPC) or to both national classification and IPC 



B. FIELDS SEARCHED 



Minimum documentation searched (classification system followed by classification symbols) 

IPC 7 C07K A61K 



Documentation searched other than minimum documentation to the extent that such documents are included in the fields searched 



Electronic data base consulted during the Internaiiona) searcli (name of data base and, where practical, search terms used) 

EPO-Internal , BIOSIS, WPI Data, EMBASE, MEDLINE, PAJ, Sequence Search 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category " Citation of document, with indication, where appropriate, of the relevant passages 



Relevant to claim No. 



DATABASE EMBL 23 April 2003 (2003-04-23), 

"SARS coronavlrus urbani , complete 

genome. " 

XP002304795 

retrieved from EBI 

Database accession no. AY278741 

abstract 

-& ROTA P A ET AL: "Characterization of a 
novel coronavirus associated with severe 
acute respiratory syndrome" SCIENCE, 
AMERICAN ASSOCIATION FOR THE ADVANCEMENT 
OF SCIENCE,, US, 
vol . 300, no. 5624, 

30 May 2003 (2003-05-30), pages 1394-1399, 
XP002269482 
ISSN: 0036-8075 
the whole document 



1-84 



1-84 



-A 



Further documents are listed in the continuation of box C. 



El 



Patent family members are listed Ii annex. 



o Special categories of cited docunnents : 

"A" document defining the general stale of the art which Is no! 

considered to be of particular relevance 
'E* earlier document but publfehGd on or after the International 

tiling date 

"L" document which may throw doubts on priority claim(s) or 
Which Is cited 1o establish the publicaiion date of another 
citation or other special reason (as specified) 

"C document referring to an oral disclosure, use, exhibition or 
other means 

•P' document published prior to the international filing date but 
later than the priority date claimed 



•T' later document published after the international filing date 
or priority date and not In conflict with the application but 
cited to understand the principle or theory underlying the 
Invention 

"X" docunnent of particular relevance; the clained invention 
cannot be considered novel or cannot be considered to 
Involve an inventive step when the document is taken alone 

■Y' document of particular relevance; the claimed Invention 

cannot be considered to Involve an Inventive step when the 
document is combined with one or more other such docu- 
ments, such combination being obvious to a person skilled 
In the art. 

document member of the same patent famil/ 



Date of the actual completion of the international search 



10 November 2004 



Date of mailing of the International search report 



26/11/2004 



Name and mailing address of the ISA 

European Patent Office, P.B. 5818 Palentlaan 2 
NL-2280 HV Rijswijk 
Tel. (+31-70) 340-2040, Tx. 31 651 epo nl, 
Fax: (+31-70) 340-3016 



Authorized officer 



Grotzinger, T 



Form PCT/ISA^IO {aeoond sheet) (January 2004} 



page 1 of 3 



INTERNATIONAL SEARCH REPORT 



li^Tiational Application No 

W-/US2004/023345 



C.{Ccntinuation) DOCUMENTS CONSIDERED TO BE RELEVANT 



Category 



p,x 



P.X 



citation of document, with indication, where appropriate, of tiie relevant 



DATABASE EMBL 15 April 2003 (2003-04-15), 

"SARS coronavirus T0R2, complete genome." 

XP002304796 

retrieved from EBI 

Database accession no. AY274119 

abstract 

MARRA MA ET AL: "The genome sequence 
of the SARS-associated coronavirus" 
SCIENCE, AMERICAN ASSOCIATION FOR THE 
ADVANCEMENT OF SCIENCE,, US, 
vol , 300, no. 5624, 

30 May 2003 (2003-05-30), pages 1399-1404, 
XP002269483 
ISSN: 0036-8075 
the whole document 

WO 93/23421 A (SMITHKLINE BEECHAM CORP ; 
JONES ELAINE V (US); KLEPFER SHARON (US); 
MI) 25 November 1993 (1993-11-25) 

page 3, line 24 - line 31 
page 4, line 6 - line 30 

DATABASE ENBL 

30 November 2003 (2003-11-30), "SARS 

coronavirus CUHK-AGOl, complete genome." 

XP002304797 

retrieved from EBI 

Database accession no. AY345986 

abstract 

-& CHIM S ET AL: "Genomic 

characterisation of the severe acute 

respiratory syndrome coronavirus of Amoy 

Gardens outbreak in Hong Kong" LANCET THE, 

LANCET LIMITED. LONDON, GB, 

vol. 362, no, 9398, 

29 November 2003 (2003-11-29), pages 

1807-1808, XP0a4476558 

ISSN: 0140-6736 

the whole document 



RelGvant to claim No. 



1-84 



1-84 



DATABASE EMBL 7 January 2004 (2004-01-07). 

"SARS coronavirus TW6, complete genome." 

XP002304798 

retrieved from EBI 

Database accession no. AY502929 

abstract 



39,40, 
44-48, 
59-61, 
57-69 



1-84 



1-84 



Form PCT/ISA/SIO (continuation of second slieetj (January 2004) 



1-84 



page 2 of 3 



INTERNATIONAL SEARCH REPORT 



CCContlnuatlon) DOCUMENTS CX3NSIDERED TO BE RELEVANT 



In^national Application No 

W-/US2004/023345 



Category " 



p.x 



Citation of document, with indication, wtiere approprl^e, of the relevant 



YEH SHIOU-HWEI ET AL: 
"Characterization of severe acute 
respiratory syndrome coronavirus genomes 
1n Taiwan: Molecular epidemiology and 
genome evolution." PROCEEDINGS OF THE 
NATIONAL ACADEMY OF SCIENCES OF THE UNITED 
STATES OF AMERICA, 
vol . 101, no. 8, 

24 February 2004 (2004-02-24), pages 
2542-2547, XP002304793 
ISSN: 0027-8424 
the whole document 



DATABASE EMBL 

28 February 2004 (2004-02-28), "SARS 

coronavirus TW-GD5 isolate TW-GD5_SC22-23 

repl lease IB and spike glycoprotein genes, 

partial cds." 

XP002304799 

retrieved from EBI 

Database accession no. AY451903 

abstract 

YANG ZHI-YONG ET AL: "A DNA vaccine 
induces SARS coronavirus neutralization 
and protective immunity 1n mice" 
NATURE (LONDON), 
vol . 428, no. 6982, 

1 April 2004 (2004-04-01), pages 561-564, 

XP002304794 

ISSN: 0028-0836 

abstract 

page 563, right-hand column, section 
"Immunogen and plasmid construction" 



Relevant to claim No. 



1-84 



1,4-20, 

24-30. 

37-48, 

67-69, 

73,74 



1-84 



Form PCTflSA/210 (continuation of second sheet) (January 2004) 



page 3 of 3 



INTERNATIONAL SEARCH REPORT 



f 



ternational application No. 
PCT/US2 0 04/02 3345 



Box No, I Nucleotide and/or amino acid sequence(s) (Continuation of item l.b of the first sheet) 



With regard to any nucleotide and/or amino add sequence disclosed In the intsrnational application and necessary to the claimed 
Invention, the International search was carried out on th9 basis of: 

a. type of material 

I I a sGquence listing 

I I table(s) related to the sequence listing 

b. format of material 
In written format 
in computer readable form 



time of filing/furnishing 

contained In the International application as filed 



5^ 



I ^ I fllecl together with the International application In computer readable form 
I I furnished subsequentiy to this Authority for the purpose of search 

2. Q In addition Jn the case that more than one versfon or copy of a sequence listing and/or table relating thereto has been filed 
or furni^ed, the required statements that the Information In the subsequent or additional copies is identical to that in the 
application as filed or does not go beyond the application as filed, as appropriate, were furnished, 

2. Additional comments: 



Form PCT/lSA/210 (continuation of first sheet {1]) (January 2004) 



INTERNATIONAL SEARCH REPORT 



kppternational application No. 



PCT/US2004/023345 



Box II Observations where certain claims were found unsearchable (Continuation of item 2 of first sheet) 



This International Search Report has not been established in respect of certain claims under Article 1 7(2) (a) for the following reasons: 

1. [x] Claims Nos.: 

because they relate to subject matter not required to be searched by this Authority, namely: 

Although claims 59, 62, 66, 67, as well as the dependent claims are directed 
to a method of treatment of the human/animal body, the search has been carried 
out and based on the alleged effects of the compound/compos Iti on. 

2. I I Claims Nos.: 

because they relate to parts of the International Application that do not comply with the prescribed requirements to such 
an extent that no meaningful International Search can be carried out, specifically: 



3. I I Claims Nos.: 

because they are dependent claims and are not drafted In accordance with the second and third sentences of Rule 6.4(a). 



Box III Observations where unity of invention is lacicing (Continuation of Hem 3 of first sheet) 



This Intsmational Searching Authority found mulilpte Inventions in this International application, as follows: 



"1 • I I As all required additional search fees were timely paid by the applicant, this International Search Report covers all 
' ' searchable claims. 

As all searchable claims could be searched without effort justifying an additional fee, this Authority did not Invite payment 
of any additional fee. 



3- I I As only some of the required additional search fees were timely paid by the applicant, this International Search Report 
^ ' covers only those claims for which fees were paid, specifically claims Nos.: 



No required additional search fees were timely paid by the applicant. Consequently, this International Search Report Is 
restricted to the Invention first mentioned in the claims; It is covered by claims Nos.: 



Remaric on Protest | | The additional search fees were accompanied by the applicant's protest. 

I I No protest accompanied the payment of additional search fees. 



Form PCT/ISA/21 0 (continuation of first sheet (2)) (Januar/ 2004) 



INTERNATIONAL SEARCH REPORT 

Information on patent family members 



Patent document 
cited in search report 



Publication 
date 



IijjU;national Application No 

r/US2004/023345 



WO 9323421 



25-11-1993 



Patent family 
member(s) 



publication 
date 



Al 1 
MU 


D/oy /O 




19-06-1997 


Ai i 
MU 


4240493 


A 


13-12-1993 


AU 


678971 


B2 


19-06-1997 


AU 


4241093 


A 


13-12-1993 


CA 


2134898 


Al 


25-11-1993 


CA 


2135201 


Al 


25-11-1993 


EP 


0640096 


Al 


01-03-1995 


EP 


0640097 


Al 


01-03-1995 


JP 


7508176 


T 


14-09-1995 


JP 


8501931 


T 


05-03-1996 


WO 


9323421 


Al 


25-11-1993 


WO 


9323422 


Al 


25-11-1993 



Form PCT/ISA/210 (patent family annex) (January 2004) 



